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ABSTRACT 



This report uses data from the third follow-up of the 
National Education Longitudinal Study of 1988 and the Beginning Postsecondary 
Students Longitudinal Study to project post secondary outcomes of the high 
school class of 1992. Specifically, the study tracked, over a four-year 
period, the paths of college-qualified students who first enrolled in a 
four-year college or university in 1992-93. Major findings included: (1) 

low- income students and Black or Hispanic students were less likely than 
high- income students or White and Asian students to be at least minimally 
qualified for college; (2) there were relatively few differences among income 
groups in four-year persistence and attainment rates; (3) race-ethnicity was 
associated with persistence and attainment for college-qualified students 
independent of qualification level; and (4) approximately 33 to 34 percent of 
highly college-qualified Whites and Hispanics earned bachelor's degrees 
compared with 14 percent of similarly qualified Black students. Following an 
introduction, data on characteristics of graduates qualified for college and 
enrollment are presented. In the next section, tables and data summarize 
persistence, nondegree departure, degree attainment, individual 
characteristics associated with attainment and persistence, and student debt. 
Discussion of controlling- related variables and a summary complete the 
report. Appendices provide a glossary and methodology details. (DB) 
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Highlights 



This report uses data combined from the third follow-up of the National Education Longi- 
tudinal Study of 1988 (NELS:88/94) and the Beginning Postsecondary Students Longitudinal 
Study (BPS: 89/90) to project the postsecondary outcomes of the high school class of 1992. Spe- 
cifically, it tracks, over a 4-year period, the expected path of college-qualified students who first 
enrolled in a 4-year college or university in the 1992-93 academic year. The focus of the report 
is the attainment and persistence rates for a cohort of high school graduates who had completed 
the necessary steps in high school to be prepared to enter a 4-year college or university. An ear- 
lier study' had suggested that if high school students took these steps, overall disparities in en- 
rollment rates by race-ethnicity and family income diminished. This report uses this cohort of 
students to estimate any differences in college outcomes 4 years after entry. The major findings 
of the study were as follows: 

• Low-income students were less likely than high-income students to be at least mini- 
mally qualified for college. White and Asian high school graduates were more likely 
than their black or Hispanic peers to be at least minimally qualified. 

• Once enrolled in a 4-year institution, while high-income students were more likely to 
receive a bachelor’s degree than middle-income students, there were no other differ- 
ences among income groups in their 4-year persistence and attainment rates. 

• Race-ethnicity did have an association with persistence and attainment for college- 
qualified students. College-qualified black students were less likely than white stu- 
dents to have earned a bachelor’s degree 4 years after entering a 4-year institution. 
This was true not only for black students who were not just minimally qualified, but 
also true for those who were highly or very highly qualified for college when they first 
entered. 

• Approximately 33 to 34 percent of highly or very highly college-qualified whites and 
Hispanics had earned bachelor’s degrees compared with 14 percent of blacks. After 4 
years, highly or very highly college-qualified blacks were also more likely not to be 
enrolled and not to have earned a degree — 46 percent of blacks compared with 2 1 per- 
cent of whites. 



*L. Berkner and L. Chavez, Access to Postsecondary Education for the 1992 High School Graduates (NCES 98-105) (Washing- 
ton, D.C.: U.S. Department of Education, National Center for Education Statistics, 1997). 
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Preface 



This report is part of the Postsecondary Education Descriptive Analysis Reports (PEDAR) 
series. The PEDAR series consists of reports that focus on postsecondary education policy is- 
sues, taking advantage of a variety of education data sources, especially recently completed data 
collections. Other reports in the series include Access to Postsecondary Education for the 1992 
High School Graduates (NCES 98-105); Confronting the Odds: Students at Risk and the Pipe- 
line to Higher Education (NCES 98-094); and Postsecondary Financing Strategies: How Un- 
dergraduates Combine Work, Borrowing, and Attendance (NCES 98-088). 

This report examines the attainment and persistence rates of a cohort of high school gradu- 
ates who had completed the necessary steps in high school to be prepared to enter a 4-year col- 
lege or university. An earher study had suggested that if high school students took these steps, 
overall disparities in enrollment rates by race-ethnicity and family income diminished. This re- 
port uses this cohort of students to estimate any differences in college outcomes 4 years after en- 
try. 



This analysis used merged data from 1) the National Education Longitudinal Study of 1988 
(NELS:88/94), a survey that began with eighth graders in 1988 and followed them every two 
years through 1994, and 2) the Beginning Postsecondary Students Longitudinal Study 
(BPS:89/90). The analysis was limited to 1992 high school graduates. 

The percentages and means presented in this report were produced using the public-access 
National Education Longitudinal Study and Beginning Postsecondary Students Longitudinal 
Linkage Data File (NEB) Data Analysis System (DAS), a microcomputer application that allows 
users to specify and generate their own tables from the NEB data. The DAS produces design- 
adjusted standard errors necessary for testing the statistical significance of differences shown in 
the tables. Additional information about the DAS, and how it may be obtained, is included in ap- 
pendix B of this report. 

We hope that the information provided in this report will be useful to a wide range of inter- 
ested readers, and that the results reported here will encourage others to use the NEB data. 
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Introduction 



Research has shown that black and Hispanic students and those with low income or low 
socioeconomic status (SES) are less likely than other students to enroll in postsecondary educa- 
tion. One reason these groups do not enroll is that they are not prepared for postsecondary edu- 
cation while in high school. However, a recent study of the high school class of 1992 showed 
that if students are “college qualified” and take the steps necessary to enroll in a 4-year college, 
enrollment rates among these racial-ethnic and income groups are virtually the same.^ Nonethe- 
less, the question of what happens to these college-qualified students after they enter a post- 
secondary institution remains — mere access does not necessarily lead to completion. ^ This 
analysis uses data from the first three survey waves of the National Education Longitudinal 
Study of 1988 (NELS:88/94) and the data from the Beginning Postsecondary Students Longitu- 
dinal Study survey and follow-up (BPS:89/90) to examine this question. 

Borrowing an Analogy 

People have used several analogies to describe the paths students take through high school 
and college in the pursuit of a bachelor’s degree. The analogy of a pipeline has been used by the 
National Science Foundation to describe the pathway to science, mathematics, and engineering 
degrees.'* Others have used the pipeline analogy to describe the pathways through high school to 
entrance into a 4-year college or university.^ The analogy of a track — the persistence track — has 
been used to describe the path that many students take through college. ^ Staying “on track” in 
this context means continuing through postsecondary education until one’s educational objec- 
tives are met. 

In a recent book, William Bowen and Derek Bok used the analogy of a river to describe the 
flow of students through the educational system to a 4-year degree and beyond.”^ They argue that 



Berkner and L. Chavez, /Icceii to Postsecondary Education for the 1992 High School Graduates (NCES 98-105) (Washing- 
ton, D.C.: U.S. Department of Education, National Center for Education Statistics, 1997). 

^C. Adelman, “Turn College ‘Access* Into ‘Participation*,” Education Week October 22, 1997. 

^National Science Foundation, National Science Board, Science and Engineering Indicators — 1995 (Washington D.C.: U.S. 
Government Printing Office, 1995). 

^L. Horn and C.D. Carroll, Confronting the Odds: Students at Risk and the Pipeline to Higher Education (NCES 98-094) 
(Washington D.C.: U.S. Department of Education, National Center for Education Statistics, 1997). 

^Berkner and Chavez, Access to Postsecondary Education for the 1992 High School Graduates. 

^W. Bowen and D. Bok, The Shape of the River (Princeton, NJ: Princeton University Press, 1998). 
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the image of a pipeline brings to mind a system that is more clearly defined and smooth than it 
actually is — especially to those in the “pipeline.” The mental picture of a river, however, cap- 
tures the reality of the bends and turns in the path to a 4-year degree. Without knowledge of what 
is around the bend — ^for example, taking the SATs to get into a 4-year institution — some students 
are wrecked on “hidden shoals or snags” in the river and fail to reach their goal of a 4-year de- 
gree. Throughout this report the analogy of a river is used to describe the pathway to a 4- year 
postsecondary institution. The report starts with an analysis of the bends in the river to the point 
of entry into a 4-year institution — ^aspiring to a bachelor’s degree while in the 10th grade, being 
academically prepared for college, taking the SAT/ACT, and applying to a 4-year institution. 

The conceptualization of one juncture — being academically prepared for college — is based 
on previous work with the NELS data. In a recent report on access to 4-year colleges, Berkner 
and Chavez created a scale that captured students’ readiness for college. » According to their 
scheme, a “college-qualified” student has a minimum high school grade point average of 2.7, a 
high school class percentile rank of at least 54, a NELS: 88/94 composite test percentile score of 
at least 56, a combined SAT score of at least 820, or a composite ACT score of at least 19. Being 
“college qualified” was shown in that work to be particularly important for college access. To set 
the context for the analysis of college persistence and attainment, this report briefly reexamines 
the characteristics of those members of the high school class of 1992 who were deemed college 
qualified by their academic preparation in high school. 

The report then explores projected 4-year college persistence rates of 1992 high school 
graduates who were college qualified, using merged data from NELS:88/94 and BPS:89/90. The 
next section briefly describes this data set. 



Data 

The NELS:88/94 data set has proven to be a rich source of data on the relationship between 
high school experiences and early postsecondary outcomes. However, the last collection of 
NELS:88/94 captured the sample members when most of them were only 2 years out of high 
school — much too early to be able to say much about their eventual postsecondary attainment 
and persistence. Conversely, the BPS:89/90 survey and follow-up have been a rich source of data 
on the characteristics of beginning postsecondary students’ educational attainment and persis- 
tence but have little data about participants’ high school experiences. 

To take advantage of the information on students in both NELS and BPS, a set of variables 
was created that allows inferences to be made about the students in one data set, based on the 

*See Berkner and Chavez, Access to Postsecondary Education for the 1992 High School Graduates. 
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characteristics of students in the other data set. Each respondent in the NELS data set was 
matched with a respondent in the BPS data set based on age, sex, race-ethnicity, SES, and the 
characteristics of their first postsecondary enrollment status (e.g., full- or part-time or type of in- 
stitution). Thus, pairs of respondents in each data set were linked, creating a “synthetic cohort.” 
With the merged data from BPS, the progress of the 1992 cohort of college-qualified students 
can be inferred through the first 4 years after they first enrolled in a 4-year institution. For exam- 
ple, a black, high-SES NELS respondent who first enrolled in a public 4-year institution full time 
immediately after high school was matched with a BPS respondent with identical characteristics. 
This BPS respondent’s postsecondary outcomes were then attached to the NELS sample mem- 
ber. Appendix B of this report provides a further explanation of the statistical methods used to 
create this merged data set. 
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Navigating Through High School to College 



One of the first steps in the process of earning a 4-year degree is preparing to enter a 4-year 
institution upon graduation from high school. Such a model has been used recently with the 
NELS data to examine the path through high school to any 4-year college.^ The junctures in this 
model are as follows; 

• Having aspirations for a bachelor’s degree in the 10th grade; 

• Preparing academically for college; 

• Taking college entrance exams; 

• Applying to a 4-year college; and 

• Enrolling in a 4-year college. 

These junctures are presented in figure 1 for all 1992 high school graduates. Jumping to the 
last “bend in the river,” about 40 percent of the high school class of 1992 had taken all of the 
steps to enrollment in a 4-year institution including enrolling in a 4-year postsecondary institu- 
tion by 1994 (table 1 and figure 1). About 34 percent of high school seniors never entered “the 
river” because (for a variety of reasons) they were not striving for this level of education in the 
10th grade. 

Of those who aspired to a bachelor’s degree, the most difficult juncture was becoming aca- 
demically prepared for college (figure 1). About 10 percent of 1992 graduates were no longer on 
their way to a 4-year institution because they were not minimally qualified for college. (More 
about this later.) This is in comparison with 3 percent who then failed to take a college entrance 
exam, 6 percent who did not apply to a 4-year institution, and 6 percent who did not enroll in a 4- 
year institution (even though they had applied). 

Family income has long been known to be positively associated with educational attain- 
ment. It was also associated with successfully navigating the bends in the river to a 4-year col- 
lege or university. As the level of family income increased, high school graduates were more 
likely to have passed through each of the steps toward enrollment in a 4-year institution (table 1 
and figure 2). Moreover, moving from mere aspirations to being minimally academically 



^L. Horn and C.D. Carroll, Confronting the Odds: Students at Risk and the Pipeline to Higher Education (NCES 98-094) 
(Washington D.C.: U.S. Department of Education, National Center for Education Statistics, 1997). 
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Navigating Through High School to College 



Figure 1 — Percentage of 1992 high school graduates who prepared for enrollment in a 4-year college or 
university 



□ Did not complete all prior steps Took prior steps but not current step | | Completed this and prior steps 



10th ‘grade 
bachelor's degree 
aspirations 



Prepared academically 
(at least minimally) 



Took SAT 
and/or ACT 



Applied to 4-year 
institution 



Enrolled in 4‘year 
institution by 1994 




NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, National Education Longitudinal Study and 
Beginning Postsecondary Students Longitudinal Study Linkage Data File (NEB), Data Analysis System. 
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Table 1 — Percentage of 1992 high school graduates who took the steps toward 4-year college enrollment, 
by selected characteristics^ 

lOth-grade Enrolled 





bachelor’s 

degree 

aspirations 


Minimally 

prepared 

academically 


Took SAT 
and/or 
ACT 


Applied 
to a 4-year 
institution 


in a 4-year 
institution 
by 1994 


Total 


65.8 


55.5 


52.4 


46.7 


40.3 


Sex 


Male 


62.8 


51.3 


48.5 


43.3 


37.1 


Female 


68.7 


59.7 


56.4 


50.1 


43.4 


Race-ethnicity 


White, non*Hispanic 


66.9 


58.4 


55.3 


49.2 


42.5 


Black, non-Hispanic 


63.6 


45.8 


43.4 


40.8 


35.5 


Hispanic 


56.7 


40.6 


36.7 


31.5 


26.4 


Asian Pacific Islander 


72.6 


64.0 


61.7 


55.6 


47.1 


American Indian/Alaskan Native 


53.1 


38.6 


35.3 


22.6 


18.5 


Family income 


Low (less than $25,000) 


51.0 


38.4 


34.5 


29.8 


25.3 


Middle ($25,000-$74,999) 


69.3 


59.4 


56.6 


49.9 


42.5 


High ($75,000 or more) 


88.0 


83.4 


81.7 


77.7 


70.4 


Parents’ educational level 


High school or less 


46.0 


32.6 


29.3 


24.7 


20.5 


Some college 


64.1 


53.1 


49.3 


42.0 


35.3 


College graduate 


86.0 


79.3 


77.6 


.72.9 


65.2 


Number of risk factors^ 


None 


80.8 


74.7 


72.6 


65.9 


58.1 


One 


63.9 


54.5 


51.1 


44.8 


39.3 


Two 


48.9 


35.7 


30.9 


26.0 


21.2 


Three or more 


38.8 


23.0 


18.8 


16.2 


9.7 


‘The proportion of students at each step is based on those who successfully completed all the preceding steps. 

^Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more times 


other than normal progression, lived in a 


single-parent family, had one or more older siblings 


who dropped out of high school. 



or held back a grade before. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education longitudinal Study and the Beginning Postsecondary Student longitudinal Study (NEB), Data Analysis System. 



qualified was more difficult for graduates from low-income families than those from high- 
income families. About 13 percent of low-income graduates aspired to this level of education but 
do not become academically qualified (51 percent minus 38 percent) compared with 6 percent of 
high-income graduates. 
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Figure 2 — Percentage of 1992 high school graduates who took each step towards enrollment in a 
4-year college, by family income 




Family income 



B 1 0th-grade bachelor's degree aspirations B At least minimally prepared academically B Took SAT and/or ACT 
B Applied to 4-year institution □ Enrolled in a 4-year institution by 1 994 



SOURCE: U.S. Department of Education^ National Center for Education Statistics^ National Education Longitudinal 
Study and Beginning Postsecondaiy Students Longitudinal Study Linkage Data File (NEB), Data Analysis System. 



Characteristics of Graduates Qualified for College 

As shown above, becoming academically qualified for college was one of the largest obsta- 
cles for 1992 high school graduates. Since this was such a big step for many students, this sec- 
tion takes a closer look at which graduates were and were not college qualified. 

In their report on access to 4-year colleges, Berkner and Chavez created a scale that cap- 
tures students’ readiness for college. Judgments about a student’s “academic qualifications’’ for 
college were based on a composite of high school grade point average, high school class rank, 
1992 NELS test score performance, combined SAT score, and composite ACT score. Students 
were categorized as 1) very highly qualified, 2) highly qualified, 3) somewhat qualified, 4) 
minimally qualified, and 5) marginally or not qualified. At least minimally “college-qualified” 



'°See Berkner and Chavez, Access to Postsecondary Education for the 1992 High School Graduates. 
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students were those whose highest value on five criteria would put them into the top 75 percent 
of 4-year college students. Minimum values were a 2.7 GPA, a class rank of at least 54, a 
NELS:88 composite test percentile score of at least 56, a combined SAT score of at least 820, or 
a composite ACT score of at least 19." 

A majority (65 percent) of 1992 high school graduates were at least minimally qualified for 
college. That is, about 35 percent were marginally or not qualified (table 2). About 14 percent 
were very highly qualified — meaning that they were among the top 10 percent of all undergradu- 
ate students. 

The percentage of students qualified for college varied by sex, race-ethnicity, parental edu- 
cation level, and at-risk status. Females were more likely than males to be at least minimally 
qualified for college (68 percent compared with 61 percent). However, similar percentages of 
males and females were very highly qualified. 

White and Asian high school graduates were more likely than their black or Hispanic peers 
to be at least minimally qualified. About 68 percent of white students and 73 percent of Asian 
students were at least minimally qualified compared with 47 percent of black and 53 percent of 
Hispanic students. Furthermore, high-income graduates were more likely to be at least minimally 
college qualified than were their lower income peers, as were graduates with parents with higher 
education levels compared with those with lower education levels. Eighty-six percent of high- 
income students and 82 percent of students with a college-educated parent were at least mini- 
mally qualified, compared with 52 percent of low-income students and 47 percent of students 
whose parents did not attend college. 

White and Asian graduates, graduates from high-income families, and families with high 
educational levels were also more likely than were their peers without these characteristics to be 
very highly qualified for college (table 2). About 15 percent of white and 23 percent of Asian 
graduates were very highly qualified compared with 6 percent of blacks and 8 percent of His- 
panics. 

Students “at risk” have traditionally been defined as having those characteristics that have 
been shown in prior research to increase their risk of dropping out of high school." In the present 
study, students were classified as at risk in the eighth grade if they 1) came from low SES fami- 
lies, 2) had a C average or lower in the sixth to eighth grade, 3) changed schools two or more 



* *For details of how this variable was created, see appendix A. 

"P. Kaufman and D. Bradby, Characteristics of At-Risk Students in NELS:88 (NCES 92-042) (Washington, D.C.: U.S. Depart- 
ment of Education, National Center for Education Statistics, 1992). 
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Table 2 — Percentage distribution of 1992 high school graduates according to qualiHcation to attend a 4-year 
college, by selected student characteristics 



Qualification to attend a 4“year college 





Marginally/ 
not qualified 


Minimally 

qualified 


Somewhat 

qualified 


Highly 

qualified 


Very highly 
qualified 


Total 


35.5 


16.6 


15.9 


18.2 


13.8 


Sex 


Male 


39.1 


16.7 


14.5 


16.4 


13.3 


Female 


31.9 


16.5 


17.3 


20.0 


14.4 


Race-ethnicity 


White, non-Hispanic 


31.9 


16.1 


16.6 


20.3 


15.2 


Black, non-Hispanic 


53.1 


16.7 


14.0 


9.9 


6.3 


Hispanic 


47.0 


20.7 


13.6 


10.8 


7.9 


Asian/Pacific Islander 


27.3 


14.6 


15.0 


20.2 


23.0 


American Indian/ Alaskan Native 


55.2 


22.2 


15.8 


5.9 


1.0 


Family income 


Low (less than $25,000) 


47.5 


18.7 


12.8 


13.6 


7.3 


Middle ($25,000-$74,999) 


32.4 


16.1 


17.0 


19.9 


14.6 


High ($75,000 or more) 


14.1 


11.5 


18.4 


27.0 


29.0 


Parents’ educational level 


High school or less 


52.9 


19.0 


11.4 


10.4 


6.3 


Some college 


35.7 


18.1 


17.8 


17.6 


10.8 


College graduate 


18.1 


11.9 


17.7 


27.0 


25.4 


Number of risk factors* 


None 


18.6 


13.2 


19.5 


26.5 


22.2 


One 


33.6 


20.1 


16.2 


17.4 


12.7 


Two 


53.5 


18.7 


12.8 


10.4 


4.6 


Three or more 


66.0 


18.8 


8.4 


4.2 


2.6 



♦Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 



NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 



times other than normal progression, 4) lived in single-parent families, 5) had one or more sib- 
lings who dropped out of high school, or 6) had been held back a grade. 



*^For example, normal progression from middle school to high school. 
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Graduates with no risk factors were more likely to be at least minimally qualified than were 
their peers with one or more risk factors. About 34 percent of those with three or more risk fac- 
tors were at least minimally qualified compared with 81 percent of those with no risk factors. 
Similarly, 22 percent of those with no risk factors were highly qualified for college, compared 
with 3 percent of those with three or more risk factors. 

Enrollment in a 4- Year Institution 

Overall, about 45 percent of the whole high school class of 1992 enrolled in a 4-year insti- 
tution by 1994 (figure 3).*'* However, there were differences in enrollment rates according to stu- 
dent characteristics. For example: 

• Hispanics were less likely than whites to enroll in a 4-year institution; 

• Low-income students were less likely to enroll than higher income students; 

• Students with more highly educated parents were more likely than others to enroll; and 

• Students with fewer risk factors enrolled at higher rates than those with more risk fac- 
tors. 

However, when one looks just at those 1992 graduates who were college qualified and took 
the necessary steps to enroll — those who took college entrance exams and applied to a 4-year in- 
stitution — many of these differences diminish or disappear. For example, among college- 
qualified students who took the steps to enroll, 77 percent of Hispanics enrolled in a 4-year 
school compared with a statistically indistinguishable 82 percent of black students and 84 per- 
cent of white students. Furthermore, college-qualified students with one or two risk factors en- 
rolled at similar rates as those with no risk factors — 87 percent for those with none compared 
with 83 percent with one and 79 percent with two risk factors. 

While, among college-qualified students, those with college-educated parents still enrolled 
at higher rates than those with parents without a college degree, these differences were less 
striking than the differences shown above among all graduates. For example, about 26 percent of 
all graduates with parents with a high school education enrolled in a 4-year college, compared 
with 71 percent of those with parents who had a college degree — a 45 percentage-point differ- 
ence. Among college-qualified graduates, this difference was 14 percentage points — 76 percent 
versus 90 percent. 



*‘*This percentage is higher than that shown in figure 1 since some proportion of students enrolled in a 4-year institution without 
navigating all of the junctures in the process. 
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Figure 3 — Percentage of 1992 high school graduates who enrolled in a 4-year institution in the 1992-93 academic year 
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The previous sections indicated that among students who were “college qualified” and took 
the steps necessary to enroll in a 4-year college, enrollment rates among racial-ethnic and in- 
come groups were virtually the same. Nonetheless, the question of what happens to these col- 
lege-qualified students after they enter a postsecondary institution remains — mere access does 
not necessarily lead to completion. 

Using the combined NELS/BPS data, it was possible to project what would have happened 
to these college-qualified 1992 graduates after they entered a 4-year college based on the experi- 
ences of the BPS cohort. Figure 4 projects postsecondary outcomes for a standard cohort of 
1,000 college-qualified 1992 high school graduates who entered a 4-year institution in the 1992- 
93 academic year. The figure illustrates the projected flow of the high school class of 1992 
through the first four years after graduation. For example, of 1,000 1992 high school graduates 
who entered a 4-year institution in the 1992-93 academic year, 832 continued to be enrolled in 
the 4-year sector in the 1993-94 academic year. That is, 83 percent stayed “on the river” between 
their first and second academic years. 

For simplicity of presentation, the findings in this section are discussed as if outcomes of 
the sample of 1992 high school graduates were actually observed rather than inferred. However, 
the reader should keep in mind the findings are based on projected data. 

Persistence 

There are a variety of ways to measure persistence in 4-year colleges and universities. In 
this report persistence is defined as continued enrollment in the 4-year sector — regardless of 
whether the student was continuously enrolled in the same institution. Measured in this manner, 
the projected persistence rates for 1992 high school graduates who continued in the 4-year sector 
from the previous year were quite high. In the first year, 83 percent stayed, 91 percent of those 
persisted into their second year, and 80 percent of those who persisted into their second year per- 
sisted into their third year (figure 4). Five years after high school graduation, 307 of the original 
1,000 college-qualified students (31 percent of the total) were still enrolled in a 4-year institution 
and had not yet earned a bachelor’s degree. 

'^See L. Beikner, A. McCormick, and S. Cuccaro-Alamin, Descriptive Summary of 1989-90 Beginning Postsecondary Students: 
5 Years Z/j/er (Washington D.C: U.S. Department of Education, National Center for Education Statistics, 1996). 
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Figure 4 — ^Projected persistence, degree attainment, and nondegree departure among 1992 high school 
graduates enrolled in a 4-year institution in the 1992-93 academic year 



Academic 

year 



Nondegree 

departure 



Persistence 



Degree 

attainment 




1,001 

661 



NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S, Department of Education, National Center for Education Statistics, National Education Longitudinal Study and 
Beginning Postsecondary Students Longitudinal Study Linkage Data File (NEB), Data Analysis System, 



o 

ERIC 



best copy available 



27 



14 



Projected Overall Persistence and Degree Attainment 



Nondegree Departure 

Students can stop attending college in a variety of ways. One way is to drop out of the 4- 
year sector without attaining any degree. About 17 percent of the 1992 graduates were projected 
to have dropped out of the 4-year sector after their first year. Out of the original 1,000 students, 
86 transferred to a less-than-4-year institution, 41 stopped out, and 42 left and never returned to 
postsecondary education (figure 4). 

By the end of the second year, 9 percent of those who were enrolled the previous year 
dropped out of the 4-year sector — 1 percent of the total transferred downward, 4 percent stopped 
out, and 2 percent left and did not return. (Out of the original 1,000 students, 12 transferred 
downward, 38 stopped out, and 21 left and did not return.) 

By the end of the third year, 16 percent of those who were enrolled the previous year 
dropped out — 5 percent of the initial group transferred downward, 4 percent stopped out, and 3 
percent left and did not return. After the fourth year, 8 percent of those who were enrolled the 
previous year dropped out — 1 percent of the original 1,000 students transferred downward, 1 
percent stopped out, and 2 percent left and did not return. 

After four years, 120 of the original 1,000 students (12 percent of the total) were not en- 
rolled in postsecondary education. About 133 students (13 percent of the original total) were en- 
rolled, but not in a 4- year institution, and 96 students (10 percent of the original total) were 
reenrolled in a 4-year institution. 

Degree Attainment 

Another reason students stop attending a 4-year institution is that they achieve their degree 
objectives. After 4 years, a projected 29 percent of the high school class of 1992 who had entered 
a 4-year institution in the 1992-93 academic year received a bachelor’s degree. About 26 percent 
had received their degree through continual enrollment in the 4-year sector, while another 3 per- 
cent received a bachelor’s degree after leaving the 4-year sector but then returning (figure 4). 

As expected, most of the predicted bachelor’s degrees were earned in the fourth year — 84 
percent of the predicted degrees earned by all of the original cohort after 4 years were earned in 
the last year by students who had been continuously enrolled. 

Some college-qualified students may have left the 4-year sector after attaining a certificate 
or associate’s degree. About 4 percent of the 1992 high school graduates were projected to have 
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received an associate’s degree after 4 years, while 2 percent received some type of postsecondary 
certificate. 



Differences Between Non-Hispanic White and Black Students 

Figures 5 and 6 show the flow through their first four years of college separately for black 
and white students. While the sample size for black students is relatively small and the 



Figure 5 — Projected persistence, degree attainment, and nondegree departure among black non-Hispanic 
1992 high school graduates enrolled in a 4-year institution in the 1992-93 academic year 



Academic Nondegree Degree 

departure Persistence attainment 




NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, National Education Longitudinal Study and 
Beginning Postsecondary Students Longitudinal Study Linkage Data File (NEB), Data Analysis System. 
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corresponding standard errors are large, several differences between white and black students are 
statistically significant. In particular, after four years, black students were more likely than white 
students to be not enrolled without any degree (18 percent for blacks compared with 9 percent 
for whites). After four years black students were less likely to have completed a bachelor’s de- 
gree (14 percent for black students compared with 29 percent of white students). 



Figure 6 — Projected persistence, degree attainment, and nondegree departure among white non-Hispanic 
1992 high school graduates enrolled in a 4-year institution in the 1992-93 academic year 



Academic 



Nondegree 

_dc£arture_ 



Pcrisistcncc 



Degree 

aiiainmcnt 




1.000 

525 



NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, National Education Longitudinal Study and 
Beginning Postsecondary Students Longitudinal Study Linkage Data File (NEB), Data Analysis System. 




BESTCOPYAVAILABLE 

17 



30 



Projected Overall Persistence and Degree Attainment 



Individual Characteristics Associated With Attainment and Persistence 
College-Qualified Students 

While the overall persistence and attainment rates shown above are of interest in them- 
selves, the main purpose of this report is to examine differences in persistence and attainment 
rates for college-qualified students with different background characteristics. 

After 4 years, females were more likely than males to still be enrolled or have a de- 
gree — 12 percent of males (100 percent minus 28 percent with no degree and not enrolled) com- 
pared with 77 percent of females (table 3). Females were also more likely to have earned their 
bachelor’s degree. About 37 percent of females had earned their bachelor’s degree after 4 years, 
compared with 22 percent of males. 

Black students were more likely than white students to be not enrolled and not have earned 
a degree after 4 years. About 41 percent of black students who had been college qualified and 
had enrolled in a 4-year institution in their first year out of high school were estimated to not be 
enrolled and to have not earned a degree or certificate 4 years later. Black students were also less 
likely than white students to have earned a bachelor’s degree 4 years after entering a 4-year in- 
stitution. Approximately 13 percent of blacks were estimated to have earned a bachelor’s degree 
compared with 32 percent of whites.'^ 

Family income was generally not associated with either persistence or degree attainment. 
However, college-qualified students who had a parent with a college education were more likely 
to persist and were more likely to receive their bachelor’s degree than were students whose par- 
ents had no college experience. 

High school graduates who were more highly qualified for college before entering a 4-year 
college were more successful once they got there. After 4 years, higher qualified students were 
more likely to attain a degree or certificate of some kind and were more likely to have earned a 
bachelor’s degree (table 3). Five years after first enrolling in a 4-year institution, lower qualified 
graduates were more likely to no longer be enrolled without having earned any kind of degree or 
certificate (table 3). 



*^The apparent differences between black students and Hispanic and Asian students were not statistically significant. 
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Table 3— Percentage distribution of college-qualified’ 1992 high school graduates who enrolled in a 4-year 
college, according to projected persistence and attainment 4 years after high school graduation, 
by selected student characteristics 





Projected persistence and attainment 
4 years after high school graduation 


4 


Projected highest degree attained 
years after high school graduation 




Attained 
a degree 


No degree, 
enrolled 


No degree, 
not enrolled 


None 


Certificate 


Associate’s 

degree 


Bachelor’s 

degree 


Total 


36.0 


38.7 


25.3 


64.0 


2.2 


3.9 


29.9 


Sex 


Male 


27.6 


44.3 


28.1 


72.4 


1.5 


3.8 


22.3 


Female 


43.4 


33.8 


22.9 


56.6 


2.8 


4.0 


36.6 


Race-ethnicity 


White, non-Hi spanic 


38.3 


38.9 


22.8 


61.7 


2.0 


4.5 


31.8 


Black, non-Hispanic 


19.9 


39.1 


41.0 


80.1 


5.1 


2.0 


12.8 


Hispanic 


27.6 


44.3 


28.1 


72.4 


2.3 


0 


25.3 


Asian/Pacific Islander 


34.9 


32.7 


32.4 


65.1 


0.7 


3.5 


30.7 


American Indian/ Alaskan Native 


— 


— 


— 


— 


— 


— 


— 


Family income 


Low (less than $25,000) 


33.0 


37.6 


29.4 


67.0 


3.1 


2.6 


27.3 


Middle ($25,000-$74,999) 


34.4 


39.6 


26.0 


65.6 


2.3 


5.2 


26.9 


High ($75,000 or more) 


42.6 


36.6 


20.9 


57.4 


1.8 


3.2 


37.6 


Parents’ educational level 


High school or less 


29.6 


38.7 


31.7 


70.4 


3.7 


2.1 


23.9 


Some college 


32.3 


39.9 


27.8 


67.7 


2.6 


3.4 


26.3 


College graduate 


40.2 


37.8 


21.9 


59.8 


1.5 


5.0 


33.7 


College qualification 


Minimally qualified 


30.9 


36.5 


32.7 


69.1 


4.0 


2.2 


24.7 


Somewhat qualified 


32.1 


39.2 


28.7 


67.9 


2.3 


4.4 


25.5 


Highly qualified 


36.8 


39.6 


23.6 


63.2 


1.8 


5.4 


29.6 


Very highly qualified 


40.1 


38.5 


21.4 


59.9 


1.6 


2.9 


35.6 


Number of risk factors^ 


None 


36.7 


38.9 


24.5 


63.4 


1.7 


3.6 


31.3 


One 


35.5 


38.0 


26.5 


64.5 


2.6 


3.7 


29.2 


Two 


32.7 


40.2 


27.1 


67.3 


4.1 


5.4 


23.2 


Three or more 


26.1 


43.1 


30.8 


73.9 


3.5 


0 


22.6 



— Sample size too small for a reliable estimate. 

^Minimally qualified for attending a 4-year institution. 

^Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more times 
other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high school, 
or held back a grade before. 

NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE; U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 
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Eighth-grade risk factors had no notable association with either persistence or degree at- 
tainment. About the same proportion of college-qualified students persisted and received their 
bachelor’s degree regardless of how many risk factors they had. 

Highly College-Qualified Students and Minimally Qualified Students 

The findings presented so far considered all students who were at least minimally qualified 
for college. That is, they had either a minimum of a 2.7 high school GPA, a class percentile rank 
of at least 54, a NELS test composite of at least 56, a combined SAT score of at least 820, or a 
composite ACT score of at least 19. A wide range of ability and knowledge is included within 
this group of at least minimally college-qualified students, from students who were barely quali- 
fied for a 4-year college (e.g., they had an SAT of 820) to those who were very highly qualified 
(e.g., they had an SAT of 1530). Differences found in the previous section between groups de- 
fined by their parental education and race-ethnicity may be due to variability among these 
groups in their qualifications for college. That is, minimally qualified students may have parents 
with lower levels of education than those of highly qualified students. This section attempts to 
control for some of these differences by separating those students who were only minimally or 
somewhat qualified from those who were highly or very highly qualified for college. 

Looking at just those students who were minimally or somewhat qualified for college, 
family income did not seem to be associated with either persistence or degree attainment in this 
group (table 4). However, white students who were only minimally or somewhat college quali- 
fied were much more likely than black and Hispanic students to have earned a bachelor’s degree 
after 4 years — ^30 percent of whites compared with 12 and 13 percent of blacks and Hispanics, 
respectively (table 4). 

Looking at just those students who were highly or very highly college qualified revealed 
somewhat different patterns (table 5). Highly or very highly qualified students were those stu- 
dents whose highest value on any of the five criteria put them in the top 25 percent of 4-year 
college students. Minimum values were 1) a minimum high school GPA of 3.6 or higher; 2) a 
class rank percentile of at least 89; 3) a NELS composite test percentile of at least 90; and 4) a 
combined SAT of at least 1 100 or a composite ACT of at least 25. 

As with those minimally or somewhat qualified, there was no significant differences in the 
proportion of low-, middle-, and high-income students who were not enrolled and had no degree 
after 4 years. However, middle-income students within this group were less likely than 



*^The figure of 1530 for the highest SAT score is taken from the variable description in the NELS/BPS DAS. 
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high-income students to have earned a bachelor’s degree after 4 years — 29 percent compared 
with 40 percent, respectively. 



Table 4 — ^Percentage distribution of minimally or somewhat college-qualified 1992 high school graduates 
who enrolled in a 4-year college, according to projected persistence and attainment 4 years 
after high school graduation, by selected student characteristics 



Projected persistence and attainment Projected highest degree attained 

4 years after high school graduation 4 years after high school graduation 





Attained 
a degree 


No degree, 
enrolled 


No degree, 
not enrolled 


None 


Certificate 


Associate’s 

degree 


Bachelor’s 

degree 


Total 


31.7 


38.1 


30.2 


68.4 


3.0 


3.5 


25.2 


Sex 


Male 


23.5 


41.0 


35.6 


76.5 


1.9 


2.2 


19.4 


Female 


38.7 


35.7 


25.7 


61.3 


3.9 


4.7 


30.2 


Race-ethnicity 
White, non-Hispanic 


36.3 


36.3 


27.4 


63.7 


2.5 


4.2 


29.5 


Black, non-Hispanic 


21.9 


39.6 


38.5 


78.1 


7.2 


2.5 


12.2 


Hispanic 


15.1 


64.2 


20.6 


84.9 


2.7 


0 


12.5 


Asian/Pacific Islander 


18.0 


29.8 


52.3 


82.0 


0 


1.1 


16.9 


American Indian/Alaskan Native 


— 


— 


— 


— 


— 


— 


— 


Family income 


Low (less than $25,000) 


27.5 


37.3 


35.2 


72.5 


5.2 


1.9 


20.3 


Middle ($25,000-$74,999) 


32.0 


39.1 


28.9 


68.0 


3.2 


5.1 


23.7 


High ($75,000 or more) 


36.1 


36.8 


27.1 


63.9 


1.3 


3.2 


31.7 


Parents’ educational level 


High school or less 


24.7 


36.3 


39.0 


75.3 


4.2 


0.6 


19.9 


Some college 


28.6 


39.8 


31.6 


71.4 


4.0 


4.2 


20.4 


College graduate 


38.7 


37.3 


24.1 


61.3 


1.5 


4.7 


32.5 


Number of risk factors* 


None 


31.7 


35.8 


32.5 


68.3 


2.9 


2.3 


26.5 


One 


33.4 


42.3 


24.3 


66.6 


2.4 


4.5 


26.5 


Two 


27.9 


34.4 


37.7 


72.1 


5.1 


5.3 


17.5 


Three or more 


19.3 


44.8 


35.9 


80.7 


5.7 


0 


13.6 



— Sample size too small for a reliable estimate. 

♦Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 

NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 
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Table 5 — Percentage distribution of highly or very highly college-qualified 1992 high school graduates who 
enrolled in a 4-year college, according to projected persistence and attainment 4 years after high 
school graduation, by selected student characteristics 

Projected persistence and attainment Projected highest degree attained 

4 years after high school graduation 4 years after high school graduation 

Attained No degree. No degree. Associate’s Bachelor’s 





a degree 


enrolled 


not enrolled 


None 


Certificate 


degree 


degree 


Total 


38.4 


39.1 


22.6 


61.6 


1.7 


4.2 


32.5 


Sex 


Male 


29.9 


46.1 


24.0 


70.1 


1.2 


4.7 


23.9 


Female 


46.1 


32.7 


21.3 


54.0 


2.2 


3.7 


40.2 


Race-ethnicity 


White, non-Hispanic 


39.3 


40.2 


20.6 


60.7 


1.8 


4.6 


32.9 


Black, non-Hispanic 


16.4 


38.1 


45.5 


83.7 


1.4 


1.1 


13.8 


Hispanic 


35.6 


31.4 


32.9 


64.4 


2.0 


0 


33.6 


Asian/Pacific Islander 


43.2 


34.2 


22.7 


56.8 


1.0 


4.7 


37.4 


American Indian/Alaskan Native 


— 


— 


— 


— 


— 


— 


— 


Family income 


Low (less than $25,000) 


37.1 


37.8 


25.2 


63.0 


1.5 


3.1 


32.4 


Middle ($25,000-374,999) 


35.7 


39.9 


24.4 


64.3 


1.9 


5.2 


28.6 


High ($75,000 or more) 


45.2 


36.5 


18.3 


54.8 


2.1 


3.2 


40.0 


Parents’ educational level 


High school or less 


34.4 


41.0 


24.5 


65.6 


3.2 


3.5 


27.7 


Some college 


34.8 


40.0 


25.2 


65.2 


1.7 


2.9 


30.2 


College graduate 


40.9 


38.1 


21.1 


59.1 


1.6 


5.1 


34.2 


Number of risk factors* 


None 


38.8 


40.2 


21.0 


61.2 


1.3 


4.1 


33.4 


One 


36.9 


35.1 


28.0 


63.1 


2.8 


3.2 


30.9 


Two 


37.6 


46.2 


16.3 


62.4 


3.1 


5.5 


29.0 


Three or more 


36.6 


40.5 


23.0 


63.4 


0 


0 


36.5 



— Sample size too small for a reliable estimate. 

*Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 

NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 



The pattern among racial-ethnic groups was also somewhat different for just those students 
who had been highly or very highly qualified compared with the other groups of students exam- 
ined above (table 5). While white students who were only minimally or somewhat college quali- 
fied were more likely than Hispanic students to have earned a bachelor’s degree after 4 years, 
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these differences did not exist among those students who were highly or very highly qualified. 
About 33 percent of white highly or very highly qualified students had earned bachelor’s degrees 
compared with 34 percent of Hispanics. 

However, black students who were highly qualified or very highly qualified were less 
likely than whites or Hispanics to have earned a bachelor’s degree (or any degree or certificate) 4 
years after enrolling in a 4-year college. About 14 percent of blacks were projected to have 
earned a bachelor’s degree. Four years after first enrolling in a 4-year college, a projected 46 
percent of black highly or very highly qualified students were no longer enrolled in college and 
had not earned any kind of degree or certificate. 

Student Debt^* 

About half of all undergraduate students in the United States receive student financial aid 
to support their education. Who borrows and the amount they borrow is heavily dependent on the 
student’s enrollment status (full- or part-time), their dependency status (whether they still live 
with their parents), and the type of institution they attend (e.g., 4-year private, 2-year public). 
Using the merged NELS/BPS data set, one can project who among the high school class of 1992 
received financial aid in their first four years of undergraduate education and the amount that 
they borrowed. About half of all 1992 high school graduates who enrolled in a 4-year institution 
borrowed for their education (table 6).'^ The proportion that borrowed varied little regardless of 
whether they finished their education or not. About 51 percent of those who attained a degree 
after 5 years had borrowed, 57 percent of those who had not received a degree but were still en- 
rolled had borrowed, and about 46 percent of those who were no longer enrolled and had no de- 
gree had borrowed. The percentage that received a loan also did not vary greatly by college 
qualifications — approximately the same proportion borrowed whether they were minimally 
qualified or very highly qualified.^o 

The picture changes somewhat when one looks at the amount that students borrowed for 
their education (table 7). Four years after high school graduation, those who were projected to 
have no degree and not be enrolled had borrowed less than those who attained a degree — $5,800 
compared with $1 1,200. This may be due to a number of factors, including the type of institution 
in which the students enrolled, their full-time or part-time status, or their dependency status. 



*^The BPS:89/90 collected data on total amount borrowed during the first four years of college but did not attempt to collect data 
on the amount of grants that students received. Therefore this section examines only loans rather than loans and grants. 

^^This includes borrowing from all sources, federal, state, institutional, and personal. 

^^ispanic students who earned a degree appear to have borrowed at higher rates than students from other racial-ethnic groups. 
However, this difference is not statistically significant at conventional levels. 




23 



36 



Projected Overall Persistence and Degree Attainment 



Table 6— Percentage of college-qualified' 1992 high school graduates who borrowed for their education 
according to projected persistence and attainment 4 years after high school graduation, 
by selected student characteristics 





Projected persistence and attainment 
4 years after high school graduation 


Projected highest degree attained 
4 years after high school graduation 




Attained 
a degree 


No degree, 
enrolled 


No degree, 
not enrolled 


None 


Associate’s 

degree 


Bachelor’s 

degree 


Total 


51.1 


56.5 


45.6 


52.4 


58.5 


50.0 


Sex 

Male 


54.9 


58.3 


45.9 


53.7 


69.9 


51.8 


Female 


48.9 


54.7 


45.2 


51.2 


46.2 


49.0 


Race-ethnicity 


White, non-Hispanic 


49.8 


54.5 


46.3 


51.6 


59.5 


48.7 


Black, non-Hispanic 


52.7 


66.2 


43.3 


56.1 


— 


47.3 


Hispanic 


73.3 


65.9 


43.5 


57.7 





71.6 


Asian/Pacific Islander 


53.4 


58.8 


40.0 


49.7 





53.5 


American Indian/Alaskan Native 


— 


— 


— 


— 


— 


— 


College qualification 


Minimally qualified 


49.5 


55.1 


41.5 


49.1 





49.0 


Somewhat qualified 


48.4 


59.0 


53.8 


57.0 





49.9 


Highly qualified 


49.5 


62.4 


41.9 


54.9 


65.7 


48.0 


Very highly qualified 


54.7 


48.5 


45.0 


47.3 


70.7 


52.2 



— Sample size too small for a reliable estimate. 

'Minimally qualified for attending a 4-year institution and enrolled in a 4-year institution. 



SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 



While a full understanding of the association of degree attainment and borrowing is beyond 
the scope of this report, it is informative to look at the amount of debt burden that accrued to stu- 
dents who left the 4-year sector without receiving any type of degree. In particular, highly quali- 
fied students ended their initial college career with a larger debt burden that did those less 
qualified. Of those minimally qualified students who entered a 4-year institution, about 42 per- 
cent had taken out a loan and had borrowed an average of about $3,600. Of the 21 percent of 
highly qualified students who had no degree and were not enrolled (table 3), about 45 percent 
had borrowed for their education and had borrowed about $8,000.2 1 



2* NEB Data Analysis System. 
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Table 7 — Average amount borrowed among college-qualined* 1992 high school graduates who borrowed 
for their education, according to projected persistence and attainment 4 years after high school 
graduation, by selected student characteristics 





Projected persistence and attainment 
4 years after high school graduation 


Projected highest degree attained 
4 years after high school graduation 




Attained 
a degree 


No degree, 
enrolled 


No degree, 
not enrolled 


None 


Associate’s 

degree 


Bachelor’s 

degree 


Total 


$11,229 


$9,321 


$5,797 


$8,186 


$6,931 


$12,227 


Sex 


Male 


11,349 


9,100 


5,035 


7,816 


— 


13,196 


Female 


11,155 


9,557 


6,565 


8,574 


— 


11,693 


Race-ethnicity 


White, non-Hispanic 


11,394 


9,200 


6,042 


8,214 


7,320 


12,366 


Black, non-Hispanic 


6,646 


9,480 


5,473 


8,118 


— 


7,760 


Hispanic 


11,325 


10,390 


4,244 


8,701 


— 


11,778 


Asian/Pacific Islander 


11,937 


9,215 


5,537 


7,782 


— 


12,842 


American Indian/Alaskan Native 


— 


— 


— 


— 


— 


— 


College qualification 


Minimally qualified 


11,739 


9,379 


3,599 


7,232 


— 


12,534 


Somewhat qualified 


10,488 


10,132 


5,165 


8,326 


— 


11,615 


Highly qualified 


10,733 


9,519 


5,728 


8,455 


— 


11,965 


Very highly qualified 


11,904 


8,251 


7,996 


8,171 


— 


12,659 



— Sample size too small for a reliable estimate. 

^Minimally qualified for attending a 4-year institution and enrolled in a 4-year institution. 



SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 



Summary of Tabular Results 

The original question posed by this analysis was what happens to college-qualified students 
once they enter a 4-year institution. Research by Berkner and Chavez had shown that the well- 
known differences in college enrollment rates for different groups of students were attenuated 
when one considered only those students who were qualified for college. The initial portion of 
this analysis summarized these findings by showing that for college-qualified students there were 
few differences in 4-year enrollment among students from different racial-ethnic backgrounds or 
from different families with different income levels. College-qualified students whose parents 
had a college degree enrolled at rates higher than other students, but the difference was less for 
college-qualified students than for students as a whole. 
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Furthermore, in analyzing the outcomes of college-qualified students after they enrolled in 
a 4-year college or university, there appeared to be few differences associated with two factors 
that are usually correlated with educational outcomes. That is, family income and prior academic 
risk factors did not seem to be variables associated with either persistence or attainment. 

However, among all college-qualified students, those with college-educated parents were 
more likely than other college-qualified students to persist and/or attain a bachelor’s degree in 4 
years. This makes some intuitive sense if one again imagines the path to a 4-year degree as a 
river with many bends and twists to it. College-educated parents have already navigated this 
same river and may be able to help their children see beyond the next bend and keep them on 
course. Alternatively, students with college-educated parents may have other resources at their 
command that other students do not — simple parental expectation and encouragement may be 
one of them. 

A less intuitive finding was that all college-qualified black students (and even highly quali- 
fied black students) were less likely than white peers after 4 years to have received a college de- 
gree and were more likely to no longer be enrolled. However, in analyzing these factors 
associated with persistence and attainment of college-qualified students, the analysis so far 
looked at each variable or factor individually. Many of the variables presented above are related 
to one another. For example, parents’ education is associated with race-ethnicity. Therefore the 
differences in the persistence and attainment rates between highly college-qualified white and 
black students shown above may be due to differences in the parental education of black and 
white students. White students may have been more likely to have college-educated parents than 
did black students and, as speculated above, this may be to the advantage of white students in 
their pursuit of a 4-year degree. 

Therefore, to isolate some of these associations, a multivariate analysis was conducted on 
these data. The results of this analysis are presented in the next section. 



Controlling for Related Variables 



In this section linear regression models are used to determine the net individual influence 
of each variable examined in the descriptive analysis on the persistence and attainment rates. 
(See appendix B for details on the multivariate methods used here.) The analysis looked at just 
two outcomes: 1) nonpersistence — defined as not having any degree and not being enrolled 4 
years after high school, and 2) bachelor’s degree attainment 4 years after graduating from high 
school. The sample of students was further restricted to those who were highly or very highly 
college qualified. 

The results of this analysis of attaining a bachelor’s degree are shown in table 8. The first 
column shows the unadjusted percentages of at least highly qualified students who had received 
a bachelor’s degree 4 years after high school graduation. These percentages match the percent- 
ages originally shown in column 7 of table 5 above. The second column presents the adjusted 
percentages, for which the variation of all the other variables in the table has been controlled. 
The italicized row is the comparison group within a row variable. The adjusted percentages dif- 
fer very little from the unadjusted percentages shown in table 5. Even after controlling for sex, 
parents’ education, parents’ income, and number of risk factors present, highly qualified black 
students were projected to be less likely than white students to attain a bachelor’s degree. 

Table 9 shows the results from the adjustment of the percentages of column 3 in table 5 
above — the percentage of 1992 graduates who, after 4 years, were projected to be no longer en- 
rolled and to have not received any type of degree. The adjusted percentages in table 9 also differ 
very little from the unadjusted percentages. After controlling for other variables in the table, 
blacks were also more likely than whites to be out of college and without a degree 4 years after 
high school graduation. 

However, even within this group of high and very highly qualified students there was a 
high degree of variation in “qualifications.” These differences may also be the source of the ob- 
served differences between black and white projected attainment and persistence rates. To be 
classified as highly or very highly college qualified, students had only to meet one of five crite- 
ria. These criteria were 1) a minimum high school GPA of 3.6 or higher; 2) a class rank percen- 
tile of at least 89; 3) a NELS composite test percentile of at least 90; or 4) a combined SAT of at 
least 1100 or a composite ACT of at least 25. Therefore, a highly qualified student could have 
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Table 8— Percentage of highly or very highly college-qualifled 1992 high school graduates who 

enrolled in a 4-year college, projected to complete the bachelor’s degree, and the adjusted 
percentage after taking into account the covariation of the variables listed in the table' 

Unadjusted Adjusted OLS Standard 

percentage^ percentage^ coefficient'* error^ 



Total 


32.5 


32.6 


22.48 


5.41 


Sex 


Male 


23.9 


23.4 


f 


f 


Female 


40.2 * 


40.7 * 


17.30 


2.65 


Race-ethnicity 


White, non-Hispanic 


32.9 


33.0 


t 


t 


Black, non-Hispanic 


13.8 * 


13.9 * 


-19.13 


6.82 


Hispanic 


33.6 


38.7 


5.76 


6.10 


Asian/Pacific Islander 


37.4 


35.9 


2.92 


5.13 


Family income 


Low (less than $25,000) 


32.4 


34.4 


t 


t 


Middle ($25,000-$74,999) 


28.6 


28.8 


-5.62 


4.03 


High ($75,000 or more) 


40.0 


40.0 


5.64 


4.79 


Parents’ educational level 


High school or less 


27.7 


29.1 


t 


f 


Some college 


30.2 


32.2 


3.07 


4.80 


College graduate 


34.2 


33.6 


4.46 


4.69 


Number of risk factors** 


None 


33.4 


33.1 


t 


f 


One 


30.9 


31.0 


-2.09 


3.04 


Two 


29.0 


33.7 


0.53 


6.02 


Three or more 


36.5 


42.3 


9.22 


13.07 



*p < .05. 

♦♦Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 

tNot applicable for the reference group. 

The italicized group in each category is the reference group being compared, 
hlie estimates are from the NEB Data Analysis System. 

^e percentages are adjusted for differences associated with other variables in the table (see appendix B). 

The percentages are also adjusted for differences associated with 1992 SAT total score. 

Ordinary least squares (OLS) coefficient, multiplied by 100 to reflect percentage (see appendix B). 

Standard error of OLS coefficient, adjusted for design effect, multiplied by 100 to reflect percentage (see appendix B). 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 
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Table 9 — Percentage of highly or very highly college-qualined 1992 high school graduates who enrolled 
in a 4-year college, projected to complete no degree and not be enrolled, and the adjusted 
percentage after taking into account the covariation of the variables listed in the table^ 

Unadjusted Adjusted OLS Standard 

percentage^ percentage^ coefficient"* error^ 



Total 


22.6 


22.7 


22.45 


5.36 


Sex 


Male 


24.0 


24.8 


t 


t 


Female 


21.3 


20.9 * 


-3.83 


2.63 


Race-ethnicity 


White, non-Hispanic 


20.6 


20.6 


t 


t 


Black, non-Hispanic 


45.5 * 


46.5 * 


25.81 


6.76 


Hispanic 


32.9 


32.4 * 


11.72 


6.05 


Asian/Pacific Islander 


22.1 


23.6 * 


2.97 


5.09 


Family income 


Low (less than $25,000) 


25.2 


22.1 


f 


t 


Middle ($25,000-$74,999) 


24.4 


24.1 


2.07 


3.99 


High ($75,000 or more) 


18.3 


20.0 


-2.03 


4.75 


Parents’ educational level 


High school or less 


24.5 


23.9 


t 


t 


Some college 


25.2 


23.7 


-0.19 


4.76 


College graduate 


21.1 


21.9 * 


-1.96 


4.65 


Number of risk factors** 


None 


21.0 


30.1 


t 


t 


One 


28.0 


21. \ * 


5.10 


3.02 


Two 


16.3 


9.9 * 


-12.11 


5.97 


Three or more 


23.0 


20.5 


-1.46 


12.96 



*p<.05. 

**Risk factors are low SES quartile, average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 
tNot applicable for the reference group. 

*The italicized group in each category is the reference group being compared. 

^The estimates are from the NEB Data Analysis System. 

^The percentages are adjusted for differences associated with other variables in the table (see appendix B). 

‘^Ordinary least squares (OLS) coefficient, multiplied by 100 to reflect percentage (see appendix B). 

^Standard error of OLS coefficient, adjusted for design effect, multiplied by 100 to reflect percentage (see appendix B). 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 
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been in the top 10 percent of his or her high school class, but still have relatively low SAT 
scores. For example, within the categories of “highly qualified” and “very highly qualified” there 
was significant variation among racial-ethnic groups in their mean SAT scores (figure 7). Black 
highly qualified students had an average SAT score 151 points lower than white highly qualified 
students, while black very highly qualified smdents had SAT scores 166 points below those of 
very highly qualified white students. These differences in qualifications may be confounding the 
association of race-ethnicity with persistence and attainment among highly qualified students. 

To further control for this variation in academic background, individual SAT scores are 
added as a control variable in tables 10 and 1 1 (while not actually presented as a row variable).22 



Figure 7 — ^Average SAT scores for 1992 college-qualified* high school graduates by race-ethnicity and level 
of college qualifications: 1994 



■ Asian or Pacific Islander BHispanic DBIack non-Hispanic B White non-Hispanic | 




Minimally qualified Somewhat qualified Highly qualified Very highly qualified 

College qualifications 



*At least minimally qualified. 

NOTE: Percentages may not sum to 100 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, National Education Longitudinal Study and 
Beginning Postsecondary Students Longitudinal Study Linkage Data File (NEB), Data Analysis System. 



22 



About 30 percent of the sample did not have SAT scores and were eliminated from this part of the analysis. 
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The percentage of highly or very highly qualified black students projected to have finished a 
bachelor’s degree after 4 years was no longer statistically different from the rate for whites (table 
10) (although it was significant at the 0.10 level). However, the percentage of black students 
projected to no longer be enrolled without obtaining a degree was still significantly higher than 
the percentage for whites, even after holding individual SAT scores constant (table 11). 



Controlling for Related Variables 



Table 10 — Percentage of highly or very highly college-qualined 1992 high school graduates who 

enrolled in a 4-year college, projected to complete the bachelor’s degree, and the adjusted 
percentage after taking into account the covariation of the variables listed in the table and 
SAT score’ 




Unadjusted 

percentage^ 


Adjusted 

percentage^ 


OLS 

coefficient'’ 


Standard 

error^ 


Total 


32.5 


32.7 


2.71 


10.50 


Sex 










Male 


23.9 


22.7 


t 


t 


Female 


40.2 * 


41.4 * 


18.62 


3.23 


Race-othnicity 










White, non-Hispanic 


32.9 


32.9 


t 


t 


Black, non-Hispanic 


13.8 * 


16.8 


-16.04 


8.28 


Hispanic 


33.6 


40.0 


7.17 


7.35 


Asian/Pacific Islander 


37.4 


34.6 


1.79 


6.17 


Family income 










Low (less than $25,000) 


32.4 


34.0 


t 


t 


Middle ($25,(XK)-$74,999) 


28.6 


29.4 


-4.66 


4.85 


High ($75,000 or more) 


40.0 


39.0 


5.02 


5.75 


Parents* educational level 










High school or less 


27.7 


30.9 


t 


t 


Some college 


30.2 


33.0 


2.06 


5.77 


College graduate 


34.2 


32.8 


1.91 


5.73 


Number of risk factors^^ 










None 


33.4 


33.0 


t 


t 


One 


30.9 


31.1 


-1.93 


3.65 


Two 


29.0 


35.0 


1.97 


7.24 


Three or more 


36.5 


39.1 


6.08 


15.74 








OLS 


Standard 




Mean 




coefficient 


error 


SAT score 


1062 




0.00019 


0.00006 


♦p < .05. 



♦♦Risk factors are low SES quartile» average grades of Cs or lower from sixth to eighth grade, changed schools two or more 
times other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
school, or held back a grade before. 

tNot applicable for the reference group. 

*The italicized group in each category is the reference group being compared. 

^e estimates are from the NEB Data Analysis System. 

^e percentages are adjusted for differences associated with other variables in the table (see appendix B). 

"^Ordinary least squares (OLS) coefficient, multiplied by 1(X) to reflect percentage (see appendix B). 

^Standard error of OLS coefficient, adjusted for design effect, multiplied by 1(X) to reflect percentage (see appendix B). 

SOURCE: U.S. Department of Education, National Center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 
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Table 11 — Percentage of highly or very highly college-qualified 1992 high school graduates Mho enrolled 

in a 4-year college, projected to have no degree and not be enrolled and the adjusted percentage 
after taking into account the covariation of the variables listed in the table and SAT score 





Unadjusted 

2 

oercentaee 


Adjusted 

percentage^ 


OLS 

coefficient^ 


Standard 

5 

error 


Total 


22.6 


22.7 


38.18 


10.42 


Sex 


Male 


24.0 


25.3 


f 


f 


Female 


21.3 


20.4 


-4.88 


3.20 


Race-ethnicity 


White, non-Hispanic 


20.6 


20.8 


f 


t 


Black, non-Hspanic 


45.5 * 


44.1 * 


23.34 


8.22 


I^spanic 


32.9 


31.4 


10.60 


7.29 


Asian/Piacific Islander 


22.7 


24.6 


3.87 


6.13 


Family income 


Low (less than $25,000) 


25.2 


22.4 


f 


f 


Middle ($25,00Q-$74,999) 


24.4 


23.1 


1.31 


4.81 


Hgh ($75,000 CH- more) 


18.3 


20.8 


-1.54 


5.71 


Parents* educational level 


High school or less 


24.5 


22.5 


f 


f 


Some college 


25.2 


23.1 


0.62 


5.73 


Cbllege graduate 


21.1 


22.6 


0.07 


5.68 


Number of risk factors** 


None 


21.0 


30.1 


f 


f 


One 


28.0 


27.0 


4.97 


3.62 


Two 


16.3 


8.8 


-13.26 


7.19 


Three or more 


23.0 


23.1 


1.05 


15.62 




Mean 




OLS 

coefficient 


Standard 

error 


SAT score 


1062 




-0.00015 


0.00005 



♦p<.05. 

**Risk factors are low SES quaitile, average grattes of Cs or lower fiom sixth to dghth grade, changed schools two or more 
times other than nornial progression, lived in a single-parent family, had one or more older siblings who dropped out of high 
sdiool, or held back a grade before. 
tNot applicable for the reference group. 

^Ihe italicized group in each category is the reference group being compared 
^The estimates are fiom the NEB Data Analysis Systan. 

■^he perooitages are adjusted for differences associated with other variables in tbe table (see appendix B). 

"^Ordinary least squares (OLS) coefficient, multiplied by 1(X) to reflect percentage (see appendix B). 

^Standard error of OLS coefficient, adjusted for design effect, multiplied by 1(X) to reflect percentage (see appendix B). 

SOURCE; U.S. Department of Education, National (center for Education Statistics, merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis Systan. 
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Summary 



It is well known that there are overall disparities in the college enrollment rates of students 
from different socioeconomic and racial-ethnic backgrounds. Low-income students and under- 
represented minority students enroll in 4-year colleges and universities at lower rates than do 
white students. Research by Berkner and Chavez cited in this report has demonstrated that if stu- 
dents are college qualified and if they take the appropriate steps that lead to enrolling in a 4-year 
college or university, these differences in enrollment rates diminish. These findings were re- 
peated in the beginning sections of this report. 

The contribution of this report was to project, to 4 years after high school completion, the 
persistence and attainment rates for those college-qualified students in the high school class of 
1992 who enrolled in a 4-year institution in the 1992-93 academic year. Generally, 4 years later, 
outcomes were similar for college-qualified students despite initial differences in demographic 
backgrounds. Among all college-qualified students, those who had a parent with a college degree 
were more likely to have persisted and to have attained a degree, but this appeared to be the re- 
sult of the higher overall college qualifications of students whose parents had a college educa- 
tion. Among students who were minimally or somewhat qualified and again among students who 
were highly or very highly qualified, parental education was not associated with higher projected 
persistence or attainment rates. 

The one comparison that stood out in this analysis was the difference between white and 
black projected persistence and attainment rates. While the statistical methods that were em- 
ployed were quite simple, none of the statistical controls that were applied to the data had the ef- 
fect of eliminating these differences. Controlling for parents’ education, family income, and the 
student’s SAT scores did not significantly affect this basic finding. This finding does conform to 
results found by other researchers that even highly qualified black college students graduate at 
lower rates than their white peers. For example, Vars and Bowen found that in the 11 selective 
schools they studied, black students at every level of SAT score had lower grades and graduated 
at lower rates than comparable white students.23 They also found that the performance gap be- 
tween whites and blacks was greatest for those at the highest SAT score levels. 



^^F.E. Vars and W. Bowen, “Scholastic Aptitude Test Scores, Race, and Academic Performance in Selective Colleges and Uni- 
versities*” in C. Jencks and M. Phillips (eds.)» The Black-White Test Score Gap (Washington* D.C.: Brookings Institution Press* 
1998 ). 
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Summary 



However, because the present study examined projected persistence and attainment rates 
and the methods used to analyze the data were basically descriptive in nature, this study can only 
suggest conclusions about actual attainment rates and persistence rates. That our results conform 
to results found elsewhere should give one confidence that our projections are accurate in the ag- 
gregate. 



Appendix A — Glossary 



This glossary describes the variables used in this report. The items were taken directly from 
the National Education Longitudinal Study: 1988-94 and Beginning Postsecondary Student 
Study: 1989-90 Linkage Data File Data Analysis System (DAS) (see appendix B for a descrip- 
tion of the DAS). These variables were either items taken directly from the NELS or BPS sur- 
veys or they were derived by combining one or more items in these surveys. Unless otherwise 
noted, the variable labels are those appearing in the DAS. 

Glossary Index 



Applied to a 4-year institution APP4YR 

Weight for BPS imputed data BPSLNKWT 

Number of risk factors 1988 BYRISK2 

4-year college qualification composite .... CQCOMV2 
Ever took S AT/ACT/applied to 4-year 

college (composite) EXMAPLY2 

1992 NELS test percentile F22XCEN 

Family income in 1991 F2P74 

Parents' highest education level 1992 F2PARED 

ACT composite 1992 F2RACTC 

Weight for 1992 high school graduates ...F3QWT92G 

Race-ethnicity 1994 F3RACE 

High school class rank percentile 1992 ...F3RANK_C 

Type of first postsecondary institution F3SEC2A1 

Gender as of 1994 F3SEX 

High school GPA (high school 

transcripts) GPA 

Postsecondary persistence and 

attainment in academic year 1989-90 P8990 

Postsecondary persistence and 

attainment in academic year 1990-91 P9091 

Postsecondary persistence and 

attainment in academic year 1991-92 P9192 

Poslsecondary persistence and 
attainment in academic year 1992-93 P9293 



Took first step in PSE pipeline PIPEl 

Took at least 2 sequential steps in PSE 

pipeline PIPE2 

Took at least 3 sequential steps in PSE 

pipeline PIPE3 

Took at least 4 sequential steps in PSE 

pipeline PIPE4 

Took all 5 sequential steps in PSE 

pipeline PIPE5 

Delayed PSE enrollment (matched to 

BPS) PSEDELAY 

Enrollment status first PSE institution 

(matched to BPS) PSEFTPT 

Institution type of first PSE enrollment PSETYPE 

Combined SAT score SAT 

High attainment and persistence status 

in AY89-90 STS8990 

Highest attainment and persistence 

status in AY90-91 STS9091 

Highest attainment and persistence 

status in AY91-92 STS9192 

Highest attainment and persistence 

status in AY92-93 STS9293 

Ever took SAT/ACT TOOKEXAM 



BEST COPY AVAllA^i^i 
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Applied to a 4-year institution 



APP4YR 



This derived variable describes whether or not a respondent ever applied to a 4-year institution. It is based primarily 
on student-reported data. 



Respondent applied to a 4-year 
postsecondary institution. 

Respondent did not apply to a 4-year 
postsecondary institution. 



Weight for BPS imputed data 



BPSLNKWT 



Weight created for the BPS/NELS study where BPS persistence variables are used to project persistence and attain- 
ment by NELS participants over a five-year time period. 



Number of risk factors 1988 BYRISK 

Number of risk factors 1988. The sum of 6 possible risk factors determined to increase the chances of dropping out 
of high school after controlling for SES and race/ethnicity. Note that this variable is different from B YATRISK All 
of the risk factors were identified in 8th grade with the exception of students having older siblings who dropped out 
of high school, which was asked in the tenth grade. If a student had missing data for 2 or more risk items, the vari- 
able was set to missing. Risk factors are as follows: 1. Low SES (BYSES); 2. Single parent family (BYFCOMP); 3. 
Older sibling dropped out (F1S94); 4. Changed schools 2 or more times (BYP40) (reported by the parent)- 5 Aver- 
age grades of Cs or lower from 6th to 8th grades (BYGRD68); 6. Repeated an earlier grade (BYS74). 

None 

One 

Two 

Three 

Four 

Five 

Six or more 

Missing, legitimate skip 



4-year college qualification composite CQCOM V 

For this study, a composite measure of 4-year college readiness or qualification index was developed that uses cu- 
mulative academic coursework CPAs, senior class rank, the NELS 1992 test scores, and the SAT and ACT college 
enu-ance examination scores. Since admission standards and requirements vary widely among 4-year colleges and 
universities, the approach used here was to examine the actual distribution of these five measures of academic apti- 
tude and achievement among those graduating seniors who did attend a 4-year college or university. Data sources 
were available for approximately half (45 percent) of the NELS graduating seniors for four or five of the criteria: 
class rank, GPA, the NELS test, and ACT or SAT scores or both. For about one-third of the seniors there were only 
three data sources available because they had no ACT or SAT scores. All of these had NELS test scores, however. 
In order to identify as many students as possible who were potentially academically qualified for a 4-year college, 
even if data were missing for these students on some of the criteria, the seniors were classified according to the 
highest level they had achieved on any of the five criteria for which data were present. 

The initial classification of the graduating seniors was determined as follows: 

• Very highly qualified: those whose highest value on any of the fivexriteria would, put them among the top 10 
percent of 4-year college students (specifically the NELS 1992 graguatihg=^ehiors who erifai${Th74%ear col- 
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leges and universities) for that criterion. Minimum values were GPA=3.7, class rank percentile=96, NELS test 
percentile=97, combined SAT=1250, composite ACT=28. 

• Highly qualified: those whose highest value on any of the five criteria would put them among the top 25 percent 
of 4-year college students for that criterion. Minimum values were GPA=3.6, class rank percentile=89, NELS 
test percentile=90, combined SAT=1 1 10, composite ACT=25. 

• Somewhat qualified: those whose highest value on any of the five criteria would put them among the top 50 
percent (i.e., in the second quartile) of 4-‘year college students for that criterion. Minimum values were 
GPA=3.2, class rank percentile=75, NELS test percentile=76, combined SAT=960, composite ACT=22. 

• Minimally qualified: those whose highest value on any of the five criteria would put them among the top 75 
percent (i.e., in the third quartile) of 4-year college students for that criterion. Minimum values were GPA=2.7, 
class rank percentile=54, NELS test percentile=56, combined SAT=820, composite ACT=19. [Those in voca- 
tional programs (according to their high school transcript) were classified as not college qualified.] 

• Marginally or not qualified: those who had no value on any criterion that would put them among the top 75 per- 
cent of 4-year college students (i.e., all values were in the lowest quartile). 

Then some adjustments were made for programs of rigorous academic course work, defined as including at least 4 
years of English; 3 years each of science, mathematics, and social studies; and 2 years of a foreign language. 

Those who had taken a program of rigorous academic courses were moved up into one higher level of qualification: 
and 

Those in the “very highly qualified” category were moved down into the “highly qualified” category if they had not 
taken the rigorous academic coursework; 

Marginally or not qualified Student was marginally or not qualified to attend a 4- 

year college. 

Student was minimally qualified to attend a 4-year 
college. 

Student was somewhat qualified to attend a 4-year 
college. 

Student was highly qualified to attend a 4-year col- 
lege. 

Student was very highly qualified to attend a 4-year 
college. 

For some tables, students were identified as “college qualified” if they were other than “marginally or not qualified” 
for the variable CQCOMV2. 



Minimally qualified 
Somewhat qualified 
Highly qualified 
Very highly qualified 
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Ever took SAT/ACT/applied to 4-year college (composite) EXMAPLY2 

This variable resolves the inconsistencies in student reporting of where they applied and where they were first en- 
rolled (EXMAPLYl). The specific purpose of this composite variable is to consider type of institution first attended 
(PSEFIRTY). Students who attended a 4-year institution (as revealed in PSEFIRTY) probably applied to such an 
institution. Therefore, students who indicated that the first type of postsecondary institution they enrolled in was a 4- 
year institution were considered to have applied to at least one 4-year institution for the purpose of determining the 
steps students must take in order to attend a 4-year college. First, EXMAPLY2 was set equal to EXMAPLYl. If 
EXMAPLYl was equal to *2 (student took exam but did not apply) and they indicated that their first type of post- 
secondary institution was a 4-year institution, EXMAPLY2 was set equal to 1 (took exam and applied). Similarly, if 
EXMAPLYl was equal to 4 (student did not take an exam nor did they apply) and they indicated that their first type 
of postsecondary institution was a 4-year institution, EXMAPLY2 was set equal to 3 (did not take exam but ap- 
plied). 



Took exam and applied 
Took exam, did not apply 
Did not take exam, applied 
Did neither 

Missing, legitimate skip 



1992 NELS test percentile F22XCEN 

The source for this variable was the composite variable consisting of math and reading NELS Second Follow-up test 
scores. To create the derived variable, a ranking was created by first calculating a weighted frequency distribution of 
test composite variables. Next, cutoff points were determined and numbered sequentially from 1 to 99. 



Family income in 1991 F2P74 

In 1992, parents were asked, “What was your total gross family income from all sources before taxes in 1991 (If you 
are not sure of the amount, please estimate)?” For purposes of this report, the original 13 income categories were 
collapsed into three. 



Low (less than $25,0(X)) 


Family 


Middle ($25,0(X>-$74,999) 


Family 


High ($75,000 or higher) 


Family 



income was less than $25,000. 

income was between $25,000 and $74,999. 

income was $75,000 or higher. 



Parents* highest education level 1992 F2PARED 

This composite characterizes the level of education attained by the parent with the highest reported education level. 

It was constructed using second follow-up parent questionnaire data; data from earlier survey waves were used if 
data were missing. For purposes of this report, levels of education were collapsed to create three categories. 

High school graduate or less Highest level of parental education was either high 

school graduation or less than high school graduation. 

Some college Highest level of parental education was greater than 

high school and less than 4-year degree. 

College graduate Highest level of parental education was college 

graduate, M.A. or equivalent, or Ph.D., M.D., or 
other professional degree. 




• Ju 
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ACT composite 1992 

American College Test (composite score). The valid range for this test score is 1 to 36. 



F2RACTC 



Weight for 1 992 high school graduates F3QWT92G 

F3QWT92G contains F3QWT for respondents who received a high school diploma between September 1, 1991 and 
August 31, 1992 or respondents whose diploma receipt date is not known but who began their postsecondary educa- 
tion during the period of June 1992 through October 1992. It allows projections to the population of persons who 
received a high school diploma during those time periods and were eligible to complete questionnaires in 1992 and 
1994. 



Race-ethnicity 1 994 ^ .SKAt.,11, 

Frequencies for this variable were created with F3QWT. “What is your racial or ethnic background? Interviewer: If 
necessary, probe by reading response categories. 

Asian or Pacific Islander 
Hispanic, regardless of race 
Black, not of Hispanic origin 
White, not of Hispanic origin 
American Indian or Alaskan Native 
Other 

1992 data for this variable were preloaded into the CATI instrument. The question was asked only in instances 
where the 1992 value was missing. If it became apparent from responses to other questions that the preloaded value 
was incorrect, this variable’s value was corrected. This question was not asked at all in the hard copy questionnaire 
and 1992 values were used for these sample members. Sample members with the value of Other were assigned the 
value -1 for this data file. 

Asian or Pacific Islander 
Hispanic regardless of race 
Black not of Hispanic origin 
White not of Hispanic origin 
American Indian or Alaskan Native 
Missing, legitimate skip 

High school class rank percentile 1992 F3RANK_C 

Frequencies for this variable were calculated with F3QWT. F3RANK_C is a measure of a student’s class rank dur- 
ing the last year he or she attended high school. It draws on F2RRANK (class rank for last year attended) and 
F2RCSIZE (class size for last year attended) from the 1992 transcript data and the 1994 cross sectional weight, 
F3QWT. It is calculated by dividing the class rank by the class size and subtracting the result from 1. The weighted 
distribution of these values is determined using F3QWT and percentiles assigned. The valid values of the percentiles 
ranges from 1-99, with the first 1.5 percent assigned the value 1 and the highest 1.5 percent assigned the value of 
99. The value of 99 thus indicates the highest-ranking students and the value of 1 indicates the lowest. 
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Type of first postsecondary institution F3SEC2A1 

Frequencies for this variable were created with F3QWT. This variable contains the institution type associated with 
the first institution attended. The primary source is the SECTOR variable in the 1993/94 Integrated Postsecondary 
Education Data System (IPEDS) data file. SECTOR is recoded to 6 values for this variable. For the few instances 
where SECTOR is missing, the variable CONTROL from the same file is used. 

None 

Private, for-profit 

Private, not-for-profit less-than-4-year 
Public less-than-2-year 
Public 2-year 

Private, not-for-profit 4-year 
Public 4-year 
Missing, legitimate skip 



Gender as of 1994 F3SEX 

Frequencies for this variable were created with F3QWT. 1992 data for gender were preloaded into the CATI instru- 
ment and displayed to the interviewer. There were no missing values in the 1992 data and the question was never 
asked. In a few instances it became apparent to the interviewer that the 1992 value was incorrect and the value was 
corrected. The hard copy questionnaire did not include this question and 1992 values were used for the hard copy 
respondents. 

Male 

Female 



High school GPA (high school transcripts) GPA 

This variable is the overall grade point average on a 4.0 scale for all courses taken for a grade. It is multiplied by 100 
for file purposes. 



Postsecondary persistence and attainment in academic year 1989-90 P8990 

The difference between this variable and PER8990R is that this variable further distinguishes those who attained a 
certificate/degree in AY89-90 and re-enrolled in PSE later (e.g., in AY90-91, AY91-92, or AY92-93) from those 
who attained a certificate/degree in AY89— 90 and left without re-enrolling. Note: the status of academic year of 
1993-94 (PER9394R) was not considered in making this variable. 

Attained AA in AY89-90+re-enrolled later 
Attained BA in AY89-90+re-enrolled later 
Persisted to AY90-91 
Downward/delayed transfer in AY89-90 
Stopout in AY89-90 
Left without return in AY89-90 
Attained cert, in AY89-90+no further PSE 
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Postsecondary persistence and attainment in academic year 1990-91 P9091 

The difference between this variable and PER9091R is that this variable further distinguishes those who attained a 
certificate/degree in AY90-91 and re-enrolled in PSE later (e.g., in AY91-92 or AY92-93) from those who attained 
a certificate/degree in AY90-91 and then left without re-enrolling. In addition, those identified as not enrolled, 
downward, delayed transfer, stopout, left, or attained a certificate/degree without further PSE in AY89-90 were 
coded as “Skip.” Note: the status of academic year of 1993-94 (PER9394R) was not considered in making this vari- 
able. 



Not enrolled 

Attained cert, in AY90-91+re-enrolled later 

Attained AA in AY90-91+re-enrolled later 

Attained BA in AY90-91+re-enrolled later 

Downward/delayed transfer in AY90-91 

Persisted to AY91-92 

Stopout in AY90-91 

Left without return in AY90-91 

Attained cert, in AY90-91+no further PSE 

Attained AA in AY90-91+no further PSE 

Missing 



Postsecondary persistence and attainment in academic year 1991-92 P9192 

The difference between this variable and PER9192R is that this variable further distinguishes those who attained a 
certificate/degree in AY91-92 and re-enrolled in PSE later (e.g., in AY92-93) from those who attained a certifi- 
cate/degree in AY92-93 and left without re-enrolling. In addition, those identified as not enrolled, downward, de- 
layed transfer, stopout, left, or attained a certificate/degree without further PSE in AY89-90 or AY90-91 were 
coded as “Skip.” Note: the status of academic year of 1993-94 (PER9394R) was not considered in making this vari- 
able. 



Not enrolled 

Attained cert, in AY91-92+re-enrolled later 
Attained AA in AY91-92+re-enrolled later 
Attained BA in AY91-92+re-enrolled later 
Down ward/delayed transfer in AY91-92 
Persisted to AY92-93 
Stopout in AY91-92 
Left without return in AY91-92 
Attained cert, in AY91-92+no further PSE 
Attained AA in AY91-92+no further PSE 
Attained BA in AY91-92+no further PSE 
Missing 



Postsecondary persistence and attainment in academic year 1992-93 P9293 

All students who were identified as not enrolled, downward/delayed transferred, stopout, left without return or at- 
tained a certificate/degree without further PSE in AY89-90, 90-91, or 91-92 were coded as “Skip.” 

Attained cert, in AY92-93 
Attained associate’s degree in AY92-93 
Attained bachelor’s degree in AY92-93 
Persisted to AY93-94 
Downward/delayed transfer in AY92-93 
Stopout in AY92-93 
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Left without return in AY92-93 
Missing 



Took first step in PSE pipeline PIPEl 

The pipeline is a cumulative sequence of 5 steps usually taken to enroll in a 4-year college (see PIPELINE for a de- 
tailed description) including: 

10th grade aspirations for a bachelor’s degree [see F1S49] 

Academic preparation [see CQCOMVl] 

Entrance exams [see EXMAPLY2] 

Application to college [see EVR4YRA] 

Enrollment in a 4-year college [see F3SEC2A1, if missing ENST1092] 

PIPEl identifies all those who took at least the first step which is to have 10th grade aspirations for a bachelor’s de- 
gree. 



No 

Yes 

Missing, legitimate skip 

Took at least 2 sequential steps in PSE pipeline PIPE2 

The pipeline is a cumulative sequence of 5 steps usually taken to enroll in a 4-year college (see PIPELINE for a de- 
tailed description) including: 

10th grade aspirations for a bachelor’s degree [see F1S49] 

Academic preparation [see CQCOMVl] 

Entrance exams [see EXMAPLY2] 

Application to college [see EVR4YRA] 

Enrollment in a 4-year college [see F3SEC2A1, if missing ENST1092] 

PIPE2 identifies all who took at least the first 2 steps: having a bachelor’s degree goal and being academically pre- 
pared. 



No 

Yes 

Missing, legitimate skip 



Took at least 3 sequential steps in PSE pipeline PIPE3 

The pipeline is a cumulative sequence of 5 steps usually taken to enroll in a 4-year college (see PIPELINE for a de- 
tailed description) including: 

10th grade aspirations for a bachelor’s degree [see F1S49] 

Academic preparation [see CQCOMVl] 

Entrance exams [see EXMAPLY2] 

Application to college [see EVR4YRA] 

Enrollment in a 4-year college [see F3SEC2A1, if missing ENST1092] 
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PIPES identifies all who took at least the first 3 sequential steps: having a bachelor’s degree goal, being academi- 
cally prepared, and taking entrance exams. 

No 

Yes 

Missing, legitimate skip 



Took at least 4 sequential steps in PSE pipeline PIPE4 

The pipeline is a cumulative sequence of 5 steps usually taken to enroll in a 4-year college (see PIPELINE for a de- 
tailed description) including: 

10th grade aspirations for a bachelor’s degree [see F1S49] 

Academic preparation [see CQCOMVl] 

Entrance exams [see EXMAPLY2] 

Application to college [see EVR4YRA] 

Enrollment in a 4-year college [see F3SEC2A1, if missing ENST1092] 

PIPE4 identifies all who took at least the first 4 sequential steps: having a bachelor’s degree goal, being academi- 
cally prepared, taking entrance exams, and applying to a 4-year college. 

No 

Yes 

Missing 

Took all 5 sequential steps in PSE pipeline PIPES 

The pipeline is a cumulative sequence of 5 steps usually taken to enroll in a 4-year college (see PIPELINE for a de- 
tailed description) including: 

10th grade aspirations for a bachelor’s degree [see F1S49] 

Academic preparation [see CQCOMVl] 

Entrance exams [see EXMAPLY2] 

Application to college [see EVR4YRA] 

Enrollment in a 4-year college [see F3SEC2A1, if missing ENST1092] 

PIPES identifies those who took all 5 steps and successfully enrolled in a 4-year college. 

No 

Yes 

Missing, legitimate skip 



Delayed PSE enrollment (matched to BPS) PSEDELAY 

This variable was created by two variables in the third follow-up of NELS: 1) high school completion date 
(F3HSCPDT) and 2) first PSE enrollment date (PSEFIRDT) in the third follow-up of NELS. A value of 1 means 
that a student did not enroll in PSE within one year of high school graduation. A value of 2 means that a student en- 
rolled in PSE within one year of high school graduation. 

No 

Yes 

Missing, blank 
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Enrollment status first PSE institution (matched to BPS) PSEFTPT 

This variable was recoded from PSEFIRST in the third follow-up of NELS. 

Full-time 
Part-time 
Missing, blank 



Institution type of first PSE enrollment PSETYPE 

This variable was recoded from PSEFIRTY in the third follow-up of NELS. Students attending institutions identi- 
fied as Private, for-profit or Private, not-for-profit less than 4-year were excluded from this study. 

Public 2-year 

Private, not-for-profit 4 year 
Public 4-year 
Missing, blank 



Combined SA T score SAT 

The combined scores of the Scholastic Assessment Test (SAT) verbal and math scores. The valid range is 400 to 
1600. 



Highest attainment and persistence status in A Y89-90 STS8990 

This variable identifies the highest attainment level and persistence status in AY89-90. 

Attained certificate 
No degree but enrolled 
No degree and not enrolled 
Missing/skip 



Highest attainment and persistence status inAY90-91 STS9091 

This variable identifies the highest attainment level and persistence status in AY90-91. 

Attained certificate 
Attained associate’s degree 
No degree but enrolled 
No degree and not enrolled 
Missing/skip 



Highest attainment and persistence status in A Y91-92 STS9192 

This variable identifies the highest attainment level and persistence status in AY91-92. 

Attained certificate 
Attained associate’s degree 
Attained bachelor’s degree 
No degree but enrolled 
No degree and not enrolled 
Missing/skip 
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Highest attainment and persistence status in AY92-93 

This variable identifies the highest attainment level and persistence status in AY92-93. 

Attained certificate 
Attained associate’s degree 
Attained bachelor’s degree 
No degree but enrolled 
No degree and not enrolled 
Missing/skip 



Ever took SAT/ ACT 

This variable was recoded from EXMAPLY2. 

No 

Yes 

Missing, legitimate skip 
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The NELS:88/94 and BPS:89/90 Linkage Data 

The NELS:88/94 and BPS:89/90 linkage file was created to provide a set of variables from 
the NELS and the BPS datasets that would allow inferences to be made about the students in one 
dataset, based on the characteristics of students in the other survey. For example, there are high 
school achievement measures (e.g. qualifications for college) in the NELS survey but there are 
no long-term postsecondary outcome measures in the NELS survey. BPS data can be used to 
supplement the NELS data to obtain estimates of these long-term postsecondary outcomes. 

The two datasets share a fairly extensive set of common variables describing the back- 
ground characteristics of sample members. These characteristics were used to link students in 
each of the datasets, thereby creating a type of “synthetic cohort,” to provide values for variables 
missing in the NELS dataset. In a sense, the data on long-term postsecondary outcomes in NELS 
(due to the censoring of the data 2 years after high school graduation), were treated as a huge 
missing data problem. These missing data were imputed for the NELS students using multiply 
imputation methods developed by Rubin, and later expanded upon by Rubin and Little. 

As a first step in this process we created the primary imputation cells from three variables 
of students’ early postsecondary experiences: 1) type and level of first PSE institution; 2) timing 
of PSE entry (immediate or delayed); and 3) enrollment status (part-time vs. full-time). Within 
each of these cells, we performed a crosstabulation based on students’ sex, race, and socioeco- 
nomic status. Because in each imputation cell, a minimal of 5 BPS donors was required for each 
NELS recipient, it was clear that cells defined by crossing students’ sex, race, and SES in most 
of the primary cells needed to be collapsed in order to reach this requirement. As a result, a total 
of 67 imputation cells in BPS were created. 

The imputation was performed through hotdecking procedures in the computer program 
PROC IMPUTE developed by the American Institutes for Research. In this procedure a NELS 
student was matched with a BPS student with replacement (WR), and the data on postsecondary 
outcomes for the BPS student were assigned to the NELS student. This procedure was performed 
36 times to capture the measurement error associated with the imputations. A disturbance term 

Rubin, Multiple Imputation for Nonresponse in Sample Surveys, 1987, Wiley and Sons; R.J.A. Little and D.B. Rubin, 
Statistical Analysis With Missing Data, 1987» Wiley and Sons. 
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was added to the NELS sample weights to reflect this added source of error in the data. This 
disturbance term inflated the weight (BPSLNKWT) so that the variance component of the esti- 
mates associated with the size of the weight was also inflated — ^accurately inflating the total vari- 
ance of the estimate to account for measurement error due to imputation. 

Accuracy of Estimates 

The statistics in this report are estimates derived from a sample. Two broad categories of 
error occur in such estimates: sampling and nonsampling errors. Sampling errors occur because 
observations are made only on samples of students, not on entire populations. Nonsampling er- 
rors occur not only in sample surveys but also in complete censuses of entire populations. Non- 
sampling errors can be attributed to a number of sources: inability to obtain complete 
information about all students in all institutions in the sample (some students or institutions re- 
fused to participate, or students participated but answered only certain items); ambiguous defini- 
tions; differences in interpreting questions; inability or unwillingness to give correct information; 
mistakes in recording or coding data; and other errors of collecting, processing, sampling, and 
imputing missing data. 



Data Analysis System 

The estimates presented in this report were produced using the merged data from the Na- 
tional Education Longitudinal Study and Beginning Postsecondary Longitudinal Study (NEB) 
Data Analysis System (DAS). The DAS software makes it possible for users to specify and gen- 
erate their own tables from the NEB data. With the DAS, users can replicate or expand upon the 
tables presented in this report. In addition to the table estimates, the DAS calculates proper stan- 
dard errors^s and weighted sample sizes for these estimates. For example, table B1 contains stan- 
dard errors that correspond to table 1 in the text, and was generated by the DAS. If the number of 
valid cases is too small to produce a reliable estimate (fewer than 30 cases), the DAS prints the 
message “low N” instead of the estimate. 

All variables in the DAS are integers. To preserve decimal precision for certain variables, 
values in the DAS have been multiplied by a factor of 10. For example, grade point averages 
range from 0—400 in the DAS. When averages for such variables appear in this report, estimates 
and standard errors were divided by the appropriate factor of 10 to restore the original scale. 



^^The NELS and BPS samples are not simple random samples and, therefore, simple random sample techniques for estimating 
sampling error cannot be applied to these data. The DAS takes into account the complexity of the sampling procedures and cal- 
culates standard errors appropriate for such samples. The method for computing sampling errors used by the DAS involves ap- 
proximating the estimator by the linear terms of a Taylor series expansion. The procedure is typically referred to as the Taylor 
series method. 
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Table B1 — Standard errors for Table 1: Percentage of 1992 high school graduates who took the 
steps toward 4-year college enrollment, by selected characteristics^ 





lOth-grade 

bachelor’s 

degree 

aspirations 


Minimally 

prepared 

academically 


Took SAT 
and/or 
ACT 


Applied 
to a 4-year 
institution 


Enrolled 
in a 4-year 
institution 
by 1994 


Total 


0.7 


0.8 


0.8 


0.9 


0.8 


Sex 


Male 


1.1 


1.1 


1.1 


1.1 


1.1 


Female 


0.9 


1.0 


1.0 


1.0 


1.0 


Race-ethnicity 


White, non-Hispanic 


0.8 


0.9 


0.9 


1.0 


1.0 


Black, non-Hispanic 


2.1 


2.3 


2.2 


2.3 


2.0 


Hispanic 


2.0 


1.9 


1.9 


1.9 


1.8 


Asian Pacific Islander 


3.0 


3.4 


3.4 


3.3 


3.1 


American Indian/Alaskan Native 


5.7 


5.0 


5.1 


4.7 


4.4 


Family income 


Low (less than $25,000) 


1.2 


1.2 


1.2 


1.1 


1.1 


Middle ($25,000-$74,999) 


0.9 


1.0 


1.0 


1.0 


1.0 


High ($75,000 or more) 


1.2 


1.5 


1.5 


1.6 


1.8 


Parents’ educational level 


High school or less 


1.3 


1.2 


1.2 


1.1 


1.0 


Some college 


0.9 


1.0 


1.0 


1.1 


1.0 


College graduate 


0.9 


1.1 


1.2 


1.2 


1.3 


Number of risk factors^ 


None 


0.8 


0.9 


0.9 


1.0 


1.1 


One 


1.2 


1.2 


1.2 


1.2 


1.2 


Two 


1.6 


1.5 


1.4 


1.3 


1.2 


Three or more 


2.4 


2.5 


2.6 


2.6 


1.0 



‘The proportion of students at each step is based on those who successfully completed all the preceding steps. 

^Risk factors are low SES quartile. average grades of Cs or lower from sixth to eighth grade, changed schools two or more times 
other than normal progression, lived in a single-parent family, had one or more older siblings who dropped out of high school, 
or held back a grade before. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Merged data from the National 
Education Longitudinal Study and the Beginning Postsecondary Student Longitudinal Study (NEB), Data Analysis System. 



In addition to tables, the DAS will also produce a correlation matrix of selected variables to 
be used for linear regression models. Included in the output with the correlation matrix are the 
design effects (DEFTs) for each variable in the matrix. Since statistical procedures generally 
compute standard errors based on an assumption of simple random sampling, the standard errors 
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must be adjusted with the design effects to take into account NELS:88/94 and BPS:89/90’s com- 
plex sample design. (See discussion under “Statistical Procedures” below for the adjustment pro- 
cedure.) 

For more information about the NEB and other Data Analysis Systems, consult the NCES 
DAS website (www.nces.ed.gov/das) or its West Coast mirror site (www.pedar-das.org), or 
contact: 

Aurora D’Amico 

NCES Postsecondary Studies Division 
555 New Jersey Avenue, NW 
Washington, DC 20208-5652 
(202)219-1365 

Internet address: Aurora_D’ Amico@ed.gov 

Statistical Procedures 

Three types of statistical procedures were employed in this report: testing differences be- 
tween means (or proportions), testing for linear trends, and adjustment of means after controlling 
for covariation among several variables. Each procedure is described below. 

Differences Between Means 

Most descriptive comparisons in this report were tested using Student’s t statistic. Statisti- 
cal significance was determined by calculating t values for differences between pairs of means or 
proportions and comparing these with published values of t for two-tailed hypothesis testing, 
using a 5 percent probability of a Type I error (a significance level of .05).26 

Student’s t values may be computed to test the difference between estimates with the fol- 
lowing formula: 



_ El - E2 
•^sej+sel 

where E\ and E 2 are the estimates to be compared and se\ and se 2 are their corresponding stan- 
dard errors. Note that this formula is valid only for independent estimates. When the estimates 
are not independent (for example, when comparing a total percentage with that for a subgroup 
that is included in the total), a covariance term must be added to the denominator of the r-test 



°A Type I error occurs when one erroneously concludes that a difference observed in a sample reflects a true difference in the 
population from which the sample was drawn. 
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formula. When comparing the estimate for a total with that of a subgroup, the following formula 
was used; 



t = 



Esub ~ Eu 



•yl SCsubA' SCioi '^P S€sub 



( 2 ) 



where p is the proportion of the total contained in the subgroup. 

There are hazards in reporting statistical tests for each comparison. First, comparisons 
based on large t statistics may appear to merit special attention. This can be misleading, since the 
magnitude of the t statistic is related not only to the observed differences in means or percentages 
but also to the number of students in the specific categories used for comparison. Hence, a small 
difference compared across a large number of students would produce a large t statistic. 

A second hazard in reporting statistical tests for each comparison occurs when making 
multiple comparisons among categories of an independent variable. For example, when making 
paired comparisons among different levels of income, the probability of a Type I error for these 
comparisons taken as a group is larger than the probability for a single comparison. When more 
than one difference between groups of related characteristics or “families” are tested for statisti- 
cal significance, one must apply a standard that assures a level of significance for all of those 
comparisons taken together. 

Comparisons were made in this report only when p<.05/k for a particular pairwise compari- 
son, where that comparison was one of k tests within a family. This guarantees both that the in- 
dividual comparison would have p<.05 and that for k comparisons within a family of possible 
comparisons, the significance level for all the comparisons will sum to p<.05.27 



For example, when comparing males and females only one comparison is possible. In this 
family, k=l, and there is no need to adjust the significance level. When students are divided into 
five racial-ethnic groups and all possible comparisons are made, then k=lO and the significance 
level for each test within this family of comparisons must be p<.05/10, or p<.005. The formula 
for calculating family size (k) is as follows: 



2 



(3) 



^^The standard that p<.05/k for each comparison is more stringent than the criterion that the significance level of the comparisons 
should sum to p<.05. For tables showing the t statistic required to ensure that p<.05/k for a particular family size and degrees of 
freedom^ see Olive Jean Dunn, “Multiple Comparisons Among Means/* Journal of the American Statistical Association 56 
(1961): 52-64. 
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where j is the number of categories for the variable being tested. For example, in the case of a 
variable with five categories such as race-ethnicity, one substitutes 5 for j in equation 3: 

. = ^(^=10 
2 



Linear Trends 

Some comparisons across categories of an ordered variable with three or more levels in- 
volved a test for a linear trend across all categories, rather than a series of tests between pairs of 
categories. Two procedures were used to test the statistical significance of an apparent linear 
trend, depending upon whether the estimates being examined were proportions (such as percent- 
ages) or averages. 

Linear trends in proportions or averages. When proportions or averages of a continuous 
variable were examined relative to a variable with ordered categories. Analysis of Variance 
(ANOVA) was used to test for a linear relationship between the two variables. To do this, 
ANOVA models included orthogonal linear contrasts corresponding to successive levels of the 
independent variable. The squares of the Taylorized standard errors (that is, standard errors that 
were calculated by the Taylor series method), the variance between the means, and the un- 
weighted sample sizes were used to partition total sum of squares into within- and between- 
group sums of squares. These were used to create mean squares for the within- and between- 
group variance components and their corresponding F statistics, which were then compared with 
published values of F for a significance level of .05.28 Significant values of both the overall F 
and the F associated with the linear contrast term were required as evidence of a linear relation- 
ship between the two variables. Means and Taylorized standard errors were calculated by the 
DAS. Unweighted sample sizes are not available from the DAS and were provided by NCES. 

Adjustment of Means to Control for Covariation Among Several Variables 

Tabular results are limited by sample size when attempting to control for the multiplicity of 
factors that may account for the variation observed between two variables. For example, when 
examining the average number of credits completed, it is impossible to know to what extent the 
observed variation is due to socioeconomic status (SES) differences and to what extent it is due 
to differences in other factors related to SES, such as type of institution attended, intensity of en- 
rollment, and so on. However, if a nested table were produced showing SES within type of in- 
stitution attended, within enrollment intensity, the cell sizes would be too small to identify the 

28More information about ANOVA and significance testing using the F statistic can be found in any standard textbook on statis- 
tical methods in the social and behavioral sciences. 




J u' . 



65 



54 



Appendix B — T echnical Notes and Methodology 



patterns. When the sample size becomes too small to support controls for another level of varia- 
tion, one must use other methods to take such variation into account. 

To overcome this difficulty, multiple linear regression was used to obtain means that were 
adjusted for covariation among a list of control variables. ^9 Adjusted means for subgroups were 
obtained by regressing the dependent variable on a set of student and enrollment characteristics. 
Substituting ones or zeros for the subgroup characteristic(s) of interest and the mean proportions 
for the other variables results in an estimate of the adjusted mean for the specified subgroup, 
holding all other variables constant. For example, consider a hypothetical case in which two 
variables, race-ethnicity and SES, are used to describe an outcome, Y (such as the number of 
credits completed). The variables race-ethnicity and SES are recoded into dummy variables: 



Race-ethnicity R 

Black, non-Hispanic 1 

All others 0 

and 

SES S 

Low SES 1 

All others 0 



The following regression equation is then estimated from the correlation matrix output from the 
DAS: 



Y = a + b]/? + b2>5 (4) 

To estimate the adjusted mean for any subgroup evaluated at the mean of all other variables, one 
substitutes the appropriate values for that subgroup’s dummy variables (1 or 0) and the mean for 
the dummy variable(s) representing all other subgroups. For example, suppose we had a case 
where Y was being described by race-ethnicity (/?) and SES (5), coded as shown above, and the 
means for R and S are as follows: 



^^For more information about multiple regression, see Michael S. Lewis-Beck, Applied Regression: An Introduction, Vol. 22 
(Beverly Hills, CA: Sage Publications, Inc., 1980) or William D. Berry and Stanley Feldman, Multiple Regression in Practice, 
Vol. 50 (Beverly Hills, CA: Sage Publications, Inc., 1987). 
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Variable Mean 



R 0.109 

S 0.282 



Suppose the regression equation results in: 



Y = 51.07 + (17.63)/? + (-22.46)5 



To estimate the adjusted value for black students, one substitutes the appropriate parameter val- 
ues into equation 4. 



Variable Parameter Value 



a 51.07 — 

R 17.63 1.000 

5 -22.46 0.282 



This results in: 

A 

Y = 51.07 -H (17.63)(1) + (-22.46)(0.282) = 62.37 

In this case, the estimated mean number of credits is 62.37 and represents the expected outcome 
for non-Hispanic black students who resemble the average student with respect to the other vari- 
ables in the model (in this example, SES). 

It is relatively straightforward to produce a multivariate model using the DAS, since one of 
the DAS output options is a correlation matrix, computed using pairwise missing values and 
properly weighted to account for the complex sample design and for nonresponse. This matrix 
can be used by most statistical software packages as the input data for least-squares regression. 
That is the approach used for this report, with an additional adjustment to incorporate the com- 
plex sample design into the statistical significance tests of the parameter estimates (described 
below). 

Most statistical software packages assume simple random sampling when computing stan- 
dard errors of parameter estimates. Because of NELS:88/94 and BPS:89/90’s complex sample 
design, this assumption is incorrect. A better approximation of the standard errors is to multiply 



^^Although the DAS enables the analyst to estimate regression models, it also limits the choices available. Analysts who wish to 
use other than pairwise treatment of missing values or to estimate other models can apply for a restricted data license from 



NCES. 




O' V 



67 



56 



Appendix B — T echnical Notes and Methodology 



each standard error by the average design effect of the dependent variable (DEFT), 3* where the 
DEFT is the ratio of the true standard error to the standard error computed under the assumption 
of simple random sampling. It is calculated by the DAS and produced with the correlation ma- 
trix. 



^*The adjustment procedure and its limitations are described in C.J. Skinner, D. Holt, and T.M.F. Smith, eds.. Analysis of Com- 
plex Surveys (New York: John Wiley & Sons, 1989). 
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Overview 

The National Education Longitudinal Study (NELS:88/94) data set is a rich source of data 
on the relationship between high school experiences and early postsecondary outcomes. How- 
ever, the last collection of NELS: 88/94 captured the sample members when most of theni were 
only 2 years out of high school — ^much too early to be able to say much about their eventual 
postsecondary attainment and persistence. Conversely, the Beginning Postsecondary Students 
Longitudinal Study and follow-up (BPS:90/94) is a rich source of data on the characteristics of 
beginning postsecondary students’ educational attainment and persistence but has little informa- 
tion about participants’ high school experiences. 

These two data sets share a fairly extensive set of common variables describing the back- 
ground characteristics of sample members. These characteristics were used in this project to link 
students in each of the data sets, thereby creating a “synthetic cohort,” to provide values for vari- 
ables missing in the NELS: 88/94 data set. In a sense, the data on long-term postsecondary out- 
comes in NELS: 88/94 (due to the censoring of the data 2 years after high school graduation) 
were treated as a huge missing data problem. These missing data were imputed for the 
NELS: 88/94 students using multiply imputation methods developed by Rubin, and later ex- 
panded upon by Little and Rubin. ^2 

The imputation was performed through a hotdeck procedure implemented through the SAS 
programming language. In this procedure a NELS:88/94 student was matched with a BPS:90/94 
student, and the data on postsecondary outcomes for the BPS:90/94 student were assigned to the 
NELS: 88/94 student. This procedure was performed 36 times to capture the measurement error 
associated with the imputations. A disturbance term was added to the NELS:88/94 sample 
weights to reflect this added source of error in the data. This disturbance term inflated the weight 
(BPSLNKWT) so that the variance component of the estimates associated with the size of the 
weight was also inflated — accurately inflating the total variance of the estimate to account for 
measurement error due to imputation. 



^2d.B. Rubin, Multiple Imputation for Nonresponse in Surveys (New York; John Wiley and Sons, 1987); R.J.A. Little and D.B. 
Rubin, Statistical Analysis With Missing Data (New York; John Wiley and Sons, 1987). 
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Each of the steps in the creation of this data set is discussed in the following sections. 



Selection of Sample and Variables for Analysis 

Sample Selection 

In order to make inferences about the students in each data set, it was important that the re- 
spondents drawn from the NELS:88/94 and BPS;90/94 data sets be as close as possible in terms 
of certain student characteristics, such as age, high school graduation status, and initial Post- 
secondary Education (PSE) enrollment. For instance, since BPS;90/94 was a cohort of first-time 
beginning postsecondary students, sample members in the NELS;88/94 were drawn who enrolled 
in some type of postsecondary education within 2 years after high school graduation. Similarly, 
the sample of BPS:90/94 respondents had to be restricted to sample members whose age was 22 
years or younger as of December 31, 1989, because the age range for sample members in 
NELS: 88/94 had an upper limit of 22 years in 1992, the year that most of them graduated from 
high school. In addition, since this new data set was based on high school graduates, students in 
both data sets who did not have a high school diploma. General Educational Development 
(GED), certification, or equivalent were excluded. Table Cl lists several selection criteria used to 
select students from the NELS: 8 8/94 and BPS: 90/94. Application of these criteria resulted in a 
total of 5,523 first-time beginning postsecondary sample members from BPS:90/94 and 7,563 
high school graduates from NELS: 88/94, accounting for 72.4 and 50.7 percent, respectively, of 
original samples. 

Table C2 presents the comparison of the selected students from the two data sets based on 
students’ sex, race-ethnicity, and age. The weighted proportions of male and female students in 
the two samples were quite comparable: in each data set, male students account for about 48 per- 
cent and female students, 52 percent. When examined by race-ethnicity, the proportion of mi- 
nority students appears higher in the NELS:88/94 sample than in the BPS:90/94 sample: 20 
percent in BPS:90/94 compared with 25 percent in the NELS:88/94. This may reflect the fact 
that minority enrollment in higher education increased significantly between 1990 and 1992. Ac- 
cording to The Condition of Education 1996 (p. 144), minority students in U.S. colleges and uni- 
versities increased from 20 percent in 1990 to 22 percent in 1992 (recall that the BPS:90/94 
sample was based on the 1989-90 NPSAS first-time beginning postsecondary education students 
and the NELS: 88/94 sample members were high school graduates in 1992). 

When examined by age, BPS:90/94 students were older than NELS:88/94 students 
were — about 12 percent of BPS:90/94 students were age 20 or older, compared with 2 percent of 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



Table C2 — Comparison (in percentage distribution by columns) of selected samples of BPS:90/94 and 
NELS:88/94 based on sex, race-ethnicity, and age 



Variable 


BPS:90/94 

(N=5,523)‘ 


NELS:88/94 

(N=9,264)’ 


Total 


100.0 


100.0 


Sex 


Male 


47.7 


48.3 


Female 


52.3 


51.7 


Race-ethnicity 


White, non-Hispanic 


79.8 


74.8 


Black, non-Hispanic 


8.8 


10.1 


Hispanic 


6.9 


9.2 


Asian/Pacific Islander 


3.6 


5.0 


American Indian/Alaskan Native 


0.8 


0.8 


Age’ 


17 years or younger 


3.5 


1.2 


1 8 years 


64.2 


71.0 


19 years 


20.8 


25.7 


20-22 years 


11.5 


2.1 


‘Weighted by BPS94AWT. 
Veighted by F3QWT. 


For NELS. 88/94 students, age was as of 1992 when students graduated from high school and could start their postsecondary 


education. For BPS:90/94 students, age was as of 1989, the year that students started their first-time postsecondary education. 



NOTE: Detail may not sum to 100.0 due to rounding. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, 1990 Beginning Postsecondary Students 
Longitudinal Study Second Follow-up and National Education Longitudinal Study of 1988 Third Follow-up. 



NELS:88/94 students. This difference was due to the different base samples targeted by the two 
surveys. BPS:90/94 started with a cohort of individuals beginning their postsecondary studies, 
regardless of when they completed high school, whereas NELS: 88/94 targeted 8th-grade students 
and followed them through high school and beyond. There are several implications for “older” 
students in BPS:90/94 — they may have delayed their postsecondary education enrollment or 
could have been held back one or more grades in elementary, middle, or high school. “Delaying” 
and “holding back” will have different impacts on the estimated values of high school achieve- 
ment or postsecondary education attainment; therefore, close attention was paid to these “older” 
students in the imputation process. 



Appendix C — NELS/BPS Merged Data File Methodology Report 



Variable Selection 

Table C3 presents the variables selected for linking NELS:88/94 and BPS:90/94 respon- 
dents. Variables were selected to assure that they were defined in the same manner or could be 
recoded so that they were defined as similarly as possible between the data sets. The list includes 
basic demographic characteristics of students such as sex, race-ethnicity, and age, as well as 
some socioeconomic measures such as parental education, occupation, and various items the 
family owns. Previous research indicates the importance of students’ educational expectations 
relative to students’ achievement and postsecondary outcomes; therefore, this variable was in- 
cluded in the list. In addition, some characteristics that describe students’ postsecondary experi- 
ences were considered, such as immediate versus delayed enrollment, part-time versus full-time 
enrollment, type and level of institution, and major field of study for earliest postsecondary edu- 
cation enrollment. 

Table C4 displays the imputation matrix for this study. The primary imputation cells (col- 
umns) were created from the three variables that describe students’ early postsecondary experi- 
ences: type and level of first postsecondary education institution, timing of postsecondary 
education entry (immediate or delayed), and attendance status (part-time versus full-time enroll- 
ment). In order to make the type and level of the postsecondary education institution comparable 
between the two data sets, we combined the categories so that the variable had three categories 
common to NELS:88/94 and BPS:90/94: l=public 4-year, 2=private not-for-profit 4-year, and 
3=public 2-year. There are a total of 12 primary imputation cells. Note that the number of 
BPS:90/94 students in each cell for private postsecondary education institutions is larger than the 
number of NELS:88/94 students (see rows 1 and 11 in table C4). However, in each cell for pub- 
lic postsecondary education institutions, the number of BPS:90/94 students is smaller than the 
number of NELS:88/94 students. 

Within each of the primary imputation cells, a crosstabulation was performed based on stu- 
dents’ sex, race-ethnicity, and socioeconomic status (SES). First, the SES variable was catego- 
rized into three levels: low SES (less than 25th percentile), middle SES (between 25th and 75th 
percentiles), and high SES (more than 75th percentile). The results of the crosstabulation indi- 
cated that there were too many small cells in the BPS:90/94 data to be managed in later matching 
procedures. Thus, the race variable was collapsed into two categories: white or Asian, Pacific 
Islander, and non- Asian, Pacific Islander minority (to collapse within the primary cell rather than 
across primary cells). Again, small cells were a problem. SES was therefore further collapsed 
into two categories split at the 50th percentile: low to middle SES (less than 50th percentile) and 
middle to high SES (equal to or more than 50th percentile). 
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Table C3 — Common variables selected from BPS:90/94 and NELS:88/94 
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Father’s or mother’s occupation DADOC/MOMOC (marketing, administrative, support, BYS7OCC/F1N7B/F2N7/BYS40OCC/F1N5B/F2N5 

service, executive, postsecondary instructor, other education, (clerical, craftsperson, farmer/farm manager, homeworker, 
engineer/architect, scientist, computer scientist, social/ laborer, manager/administrative, military, operative, 

recreation, doctor/dentist, other medical, technical, lawyer, professional I, professional II, protective service, 
blue collar, writer/artist) sales, school teacher, service, technical, never worked) 



Table C3 — Common variables selected from BPS:90/94 and N ELS:88/94 — Conti nued 

^^■able BPS:90/94 NELS:88/94 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



Validation of the Imputation Procedure 

Before imputing all of the postsecondary outcomes for the NELS: 88/94 respondents, an 
analysis was run to evaluate the validity of the proposed imputation procedure. Since both 
NELS:88/94 and BPS:90/94 had persistence and attainment information on their respondents ap- 
proximately 2 years after entering postsecondary education, this information was used to test the 
accuracy of the proposed imputation procedure for all postsecondary outcomes. Table C5 pres- 
ents the definition of 2-year persistence and attainment rates used in NELS: 88/94 and 
BPS:90/94. Substantial effort was made to construct these variables so that they had minimal 
missing data and were comparable between the two data sets. For comparability, students who 
persisted were defined as those who enrolled in postsecondary education as of May 1991 (for 
BPS:90/94 students) or as of May 1994 (for NELS:88/94 students) or those who did not enroll 
but attained some type of degree. For the attainment outcome, two measures were examined: 1) 
whether a student attained some type of degree including a certificate or associate’s degree 
within the first 2 years of postsecondary education; and 2) whether a student attained an associ- 
ate’s degree. . 

Table C6 presents actual persistence and attainment rates for both BPS:90/94 and 
NELS: 8 8/94 students and proportions of students according to their timing of postsecondary 
education enrollment, enrollment status, sex, race-ethnicity, and SES in three types of post- 
secondary education institutions. While the overall persistence rates appear comparable across 
the two data sets, the attainment rates are not — compared with BPS:90/94 students, NELS:88/94 
students appear to have higher overall attainment rates in 4-year public and private institutions, 
but have a lower rate for attaining associate’s degrees in all three types of institutions. It is plau- 
sible that the different make-up of students in the two samples contributes to these differences, 
for the percentages of students according to their postsecondary education experiences and indi- 
vidual characteristics are not equivalent. For example, the NELS: 88/94 sample appears to in- 
clude more minority and low-SES students than the BPS: 90/94 sample, whereas the BPS: 90/94 
sample appears to consist of more students who delayed their postsecondary education and en- 
rolled in postsecondary education on a part-time basis than the NELS: 88/94 sample. 
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Table C6 — Comparison (by columns) of key variables in BPS:90/94 and NELS:88/94 selected for the imputation 

Public 4-year Private, not-for-profit, 4-year Public 2-year 

Variable BPS:90/94' NELS:88/94^ BPS:90/94‘ NELS:88/94^ BPS:90/94' NELS:88/94- 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



Imputation of 2-Year Persistence and Attainment Rates 

The imputation was performed through a sequential hotdeck procedure. First, the 
BPS:90/94 and NELS: 88/94 data were combined into one data set in which 2-year persistence 
and attainment indicators (l=Yes and 0=No) were retained for BPS:90/94 sample members but 
missing for all NELS: 8 8/94 sample members. Next, one imputation was performed in each im- 
putation cell defined in table 4. Finally, imputed and actual rates were compared. Table C7 pres- 
ents the results of this imputation. 

Overall, the imputation for the 2-year persistence rate appears to be more successful than 
the imputation for the 2-year attainment rate. While the imputed persistence rates appear to be 
comparable to the actual rates in NELS, particularly for students enrolling in 4-year public and 
private not-for-profit institutions, the imputed attainment rates are mostly lower than the actual 
rates across all three types of institutions and the imputed rates for attaining an associate’s degree 
are mostly higher than the actual rates. Recall that there are many cells collapsed in “delayed 
postsecondary education” and “part-time attendance status” (see table C4). Therefore, only full- 
time students who enrolled in postsecondary education immediately after high school graduation 
were selected (no cells are collapsed for these students) and one hotdeck imputation for these 
students was performed. The results are presented in table C8. Again, the conclusions are similar 
to those derived from table Cl. While the imputed persistence rates, whether for totals or broken 
down by sex, race-ethnicity, and SES, appear close to the actual rates across all three types of 
institutions, the gap between imputed and actual attainment rates persists and appears large. 

However, the definition for attainment in the two data sets is not strictly comparable. In the 
NELS:88/94, attainment is a student’s self-report of the highest postsecondary degree attained 
and certificates and licenses are included. In the BPS:90/94, attainment is a derived and retro- 
spective variable indicating whether a student attained some type of degree at the end of aca- 
demic year 1990-91, and licenses are not included. Therefore, while the imputed 2-year 
attainment rates are not consistent with actual reported 2-year attainment rates, the overall results 
of the imputation of attainment and persistence were judged to be of reasonable quality. 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



Full Imputation 

Since the results above looked reasonable, other persistence and attainment variables were 
imputed for NELS:88/94 students. A partial list of imputed variables is included below: 

1) Enrolled anytime in postsecondary education during academic years 1993 to 1996 

Enrolled 
Not enrolled 

2) Month-by-month enrollment status July 1992 to June 1997 

Full-time 

More than half time, less than full-time 
Less than half time or unknown 

3) Ever attended graduate school 

4) Ever attended a 4-year institution 

5) Ever attended a private for-profit less-than-4-year institution 

6) Ever attended a public 2-year institution 

7) First degree attained at first institution 

Certificate 
Associate’s degree 
Bachelor’s degree 

8) Highest undergraduate degree 1997 

None 
Certificate 
Associate’s degree 
Bachelor’s degree 

9) Level and control where ended first attainment 

10) Level and control where received highest degree 

11) gLevel and control, last institution 

Public 4-year 

Private, not-for-profit, 4-year 
Private, for-profit, 4-year 
Public 2- to 3-year 
Private, not-for-profit 2- to 3-year 
Private, for-profit, 2- to 3-year 
Public less-than-2-year 
Private, not-for-profit, less-than-2-year 
Private, for-profit, less-than-2-year 

12) Number of institutions attended through first attainment 

13) N umber of institutions attended during 1 992-97 

14) Persistence and attainment during academic years 1992 to 1996 

Attained certificate 
Attained associate’s degree 
Attained bachelor’s degree 
Persisted to academic year 1993-94 
Downward/delayed transfer 
Stopout 

Left without return 
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While it is well known that hotdeck imputation is a convenient method for providing a 
fully allocated data set that produces unbiased estimates, it is also known that using just one im- 
putation systematically underestimates the variance of these estimates. To correct for this under- 
estimation, Rubin has suggested using a multiply imputed data set.33 That is, the missing data 
imputation is repeated a number of times (M). Let 0 be some parameter of interest and w, , 

/= 1 , . . . .M are the estimated 0 and their variances for M imputations. Then the combined estimate 
is: 






= yA 
hhd 



( 1 . 1 ) 



The variance associated with this estimate is the combination of the mean of the within im- 
putation variances (or, alternatively, the variance associated with the first imputation): 



Wm 




and the between imputation variance: 



Bm — 



M-1 



The total variance (sampling and measurement) of is: 



Tm = Wm + 



M-l-1 

M 



Bm 



(1.2) 



(1.3) 



(1.4) 



or for the sake of simplicity: 



M-i-1 

Tm=Wx + -—Bm (1.5) 

M 

In this study, this procedure was used to capture the uncertainty due to measurement error 
related to imputing data. For example, using the imputation cells defined above in table C4, 
NELS:88/94 sample member #1 was matched with a BPS:90/94 sample member within the ap- 
propriate imputation cell. The BPS:90/94 sample member’s postsecondary outcome variables 
were then attached to NELS:88/94 sample member #l’s record. The BPS:90/94 member was re- 
turned to the matrix and another random draw for NELS: 88/94 sample member #1 was taken 
from the imputation matrix. This was repeated 36 times. The variance between the 36 imputed 



33d. B. Rubin, Multiple Imputation for Nonresponse in Surveys (New York: John Wiley and Sons, 1987), p. 15. 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



values for NELS: 88/94 sample member #1 represented the measurement error variance for sam- 
ple member #1 due to imputing his/her postsecondary outcomes. Separate imputations were con- 
ducted for NELS:88/94 students who first enrolled in private 4-year institutions and public 4- 
year institutions. 

A summary of the results of these imputations is presented in table C9. (Some of the im- 
puted variables are not shown in this table for the sake of simplicity.) 

The second and sixth columns of this table show the mean of the average of the 36 values 
imputed for respondents who first enrolled in a private 4-year college or university and in a pub- 
lic 4-year college or university. The third and seventh columns show the variance between im- 
putations. The fourth and fifth show the standard error due to sampling (assuming simple random 
sampling), and the sampling variance (assuming simple random sampling) for the first mean for 
those attending private 4-year college or university. The eighth and ninth columns show the 
analogous data for public 4-year students. The complex standard errors (in columns 11 and 12) 
were calculated using the computer program SUDAAN, which takes into account the complex 
nature of the sampling design of NELS: 88/94. The total standard errors (the last two columns) 
are based on equation 1.5 above. The last row of the table show the mean across these columns. 
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Table C9 — Summary of results from full imputation of BPS:90/94 variables — Continued 

Private, not-for-profit 4-year Public 4-year Complex S.E. Total S.E. 

Selected Mean of 36 Var. across S.E. for the Variance Mean of 36 Var. across S.E. for the Variance Private, Private, 

BPS:90/94 means of 36 means of mean of first for the first means of 36 means of mean of first for the first not-for-profit Public not-for-profit Public 

variable imputation imputation imputation mean imputation imputation imputation mean 4-year 4-year 4-year 4-year 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



Adjustment for Measurement Error 

In most applications of Rubin and Little’s multiply imputation scheme, a completely im- 
puted data set is created M number of times (in most applications at NCES, M=5). Separate 
analyses are then run on these M data sets and then the variances of the estimates are calculated 
as in equation 1.5. In the present application of their multiply imputation scheme, the measure- 
ment error due to imputation was to be incorporated into an automated Data Analysis System 
(DAS). Therefore, a more computationally convenient method of incorporating the measurement 
error due to multiple imputations was needed. Calculating separate analyses based on M com- 
plete data sets was considered too computationally laborious and time intensive for the DAS. In- 
stead, the task was accomplished by estimating the total variances as shown in table 8 above and 
calculating the resulting design effects due to complex sampling and due to measurement error. 
These parameters were then used to inflate the variance of the weights so that the resulting stan- 
dard errors were inflated to reflect the added source of error due to multiple imputations. That is, 
a new weight was created: 



in which the NELS:88/94 third follow-up weight was “disturbed” by a factor normally distrib- 
uted with a mean of 0 and a variance of E. E is defined as: 



where deff„,/„ is the ratio of the design effect due to measurement error (ws) to the design effect 
due to complex sampling (cs) and</cjQT„is the design effect due to complex sampling. Separate E 
and D parameters were calculated for students who first enrolled in 4-year private institutions 
and students who first enrolled in 4-year public institutions. 

This disturbance term E was randomly assigned to the third follow-up weight for each case 
in sequence. For example, respondent S, was assigned a disturbance term to his/her F3QWT as 
in equation 1.6 above. After assigning the disturbance term, if S/s weight was less than 1, Sj was 
assigned a value of 1 for his/her weight (to avoid extremely low or negative weights) and S 2 had 
a disturbance term assigned to his/her F3QWT that was the residual of disturbance term of Sj. If 



BPSLNKWT = F3QWT + N(0,E) 




( 1 . 6 ) 



and D is defined as: 




(1.7) 
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Appendix C — NELS/BPS Merged Data File Methodology Report 



S, had a new weight greater than 1, then Sj had a disturbance term assigned to his/her F3QWT as 
in 1.6 above. 

Thus, new weights were calculated for each individual whose variance reflected the meas- 
urement error due to imputation. However, given the lower bound of unity for the weights, the 
dismrbance term was ultimately not randomly distributed with a mean of zero. The new weights 
were therefore adjusted to sum back to the old weights by a simple post-stratification to the sum 
of the weights for each postsecondary institution type (4-year private, not-for-profit and 4-year 
public). 

To check on the results of these calculations, weights were created based on the values of 
D and £■ for just the first variable in the list in table 9 (degall2). In this way one could see how 
well the calculation of variance based on disturbing the weights compared to the actual variances 
calculated from the spreadsheet. Using BPSLNKWT resulted in standard errors very close to 
what was estimated in the spreadsheet. For private, not-for-profit 4-year institutions SUDAAN 
produced a standard error of 0.0618 versus 0.0612 in the spreadsheet, and for public 4-year in- 
stitutions, 0.0392 versus 0.0429. 
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